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Chapter  I. 

Introduction 

Many  computer  centers  share  their  resources  through  some 
communication  linking.  For  economic  reasons  most  pairs  of 
computer  centers  are  not  directly  linked  so  a  message  or  file 
often  travels  over  several  links  to  get  to  its  destination. 

For  operational  reliability  most  source-destination  pairs 
are  connected  with  two  or  more  distinct  message  paths. 

The  choice  of  which  path  a  message  is  to  follow  is  a 
routing  decision.  This  decision  might  be  to  use  the  path 
with  the  smallest  number  of  links ,  provided  it  is  not  con¬ 
gested.  If  the  delays  resulting  from  queuing  at  the  computer 
centers  and  transmission  across  the  links  are  significant,  then 
the  routing  decision  might  be  to  use  the  path  with  the 
shortest  delay. 

These  routing  decisions  require  some  knowledge  of  the 
network.  In  centralized  routing  one  computer  center  (with 
perhaps  several  backup  centers)  takes  the  responsibility  of 
monitoring  the  network.  This  center  receives  the  status  of 
the  links  and  the  size  of  the  traffic  flow,  from  each  source 
to  each  destination.  It  sends  out  to  the  other  centers  their 
routing  instructions. 

In  distributed  routing,  each  center  determines  the  best 
route  based  on  the  status  of  the  adjacent  links  and  on  its 
neighbors'  estimates  of  their  distances  (number  of  links,  the 
delay,  or  whatever)  from  the  destination.  More  details  are 
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given  later. 

Our  basic  routing  criteria  is  that  the  routing  should 
minimize  a  network  cost  function.  The  specific  form  of  this 
function  is  given  in  the  next  section.  The  mean  delay  is  a 
common  example  of  the  network  cost. 

Iterating  a  routing  algorithm  should  lead  to  the  optimal 
routing  whenever  the  traffic  from  each  source  to  each  destina¬ 
tion  is  constant.  Gallager  [77]  gave  the  first  distributed 
routing  algorithm  that  satisfies  this  criteria.  Bertsekas 
[78]  and  Gafni  [79]  generalized  Gallager' s  algorithm  and  gave 
different  proofs  of  convergence. 

In  contrast  to  centralized  routine  these  distributed 
algorithms  are  often  guaranteed  to  converge  only  if  the  routing 
changes  are  small  at  each  iteration.  Presumably,  the  conver¬ 
gence  is  also  slower  than  that  of  centralized  algorithms.  In 
chapter  three  we  describe  a  distributed  routing  algorithm  in 
which  the  routing  changes  are  comparable  to  those  of  a  cen¬ 
tralized  routing  algorithm. 

In  order  to  maintain  a  good  routing  when  the  source- 
destination  traffic  slowly  changes,  the  routing  algorithm  is 
periodically  iterated.  An  algorithm  with  a  good  convergence 
rate  should  need  fewer  iterations  to  give  a  good  routing. 

If  the  traffic  changes  more  rapidly,  then  the  algorithm  is 
iterated  more  often.  In  the  extreme  case,  the  traffic  changes 
so  rapidly  that  the  algorithm  cannot  cope  with  it. 

Empirical  tests  with  a  specific  network  are  required  to 


determine  how  often  a  routing  algorithm  should  be  iterated. 
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Rudin  [80]  suggests  that  these  tests  should  take  into  account 
the  control  used  to  limit  the  traffic  into  the  network.  Under 
this  flow  control  a  good  routing  algorithm  allows  more  traffic 
to  enter  the  network. 

The  rest  of  this  chapter  details  our  routing  problem  and 
gives  a  simple  distributed  muting:  algorithm.  The  second 
chapter  gives  two  distributed  routing  algorithms  that  make  the 
quadratic  approximation  of  the  change  in  the  network  cost  non¬ 
positive.  This  means  that  the  flow  change  generated  is  likely 
to  make  the  cost  function  decrease.  The  third  chapter  gives 
a  class  of  routing  algorithms  in  which  the  network  control 
center  is  called  upon  to  determine  the  size  of  the  flow  change. 
The  third  chapter  also  shows  that  this  class  converges.  Appen¬ 
dix  B  illustrates  some  sample  behaarior  of  the  algorithms 
mentioned  in  the  thesis . 

1 . 1  The  Routing  Problem 

Let  .7  be  the  set  of  nodes  (switching  centers)  .  A  duplex 
communication  channel  between  nodes  i  and  j  is  interpreted  as 
a  pair  of  directed  links  (i,j)  and  (j,i).  Let  L  be  the  set  of 
directed  links. 

We  will  call  the  traffic  destined  for  node  k  commodity 
k  and  we  will  let  C  denote  the  set  of  commodities. 

The  instantaneous  description  of  the  network  consists  of 
the  number  of  bits  of  each  ecmmondity  travelling  on  each  link 
and  the  number  of  bits  of  each  commodity  waiting  at  each  node 
for  transmission  over  a  link.  This  instantaneous  description 
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is  of  little  use  for  routing  as  during  the  time  it  takes  a 
node  to  describe  to  its  neighbors  the  state  of  its  queues,  the 
queue  lengths  usually  change  significantly. 

A  less  volatile  description  of  the  network  is  the  average 
rate  over  a  given  time  interval  at  which  each  commodity  travels 
over  each  link  and  the  average  rate  at  which  each  queue  length 
changes.  If  the  time  interval  is  large  enough  then,  because 
the  queue  lengths  are  finite,  the  latter  rates  will  be  small 
compared  to  the  former.  For  example,  in  ten  seconds  500,000 
bits  might  be  sent  over  a  link  while  the  queue  length  at  the 
head  of  the  link  changes  by  5000  bits.  The  average  rate  of 
change  of  500  bits  per  second  in  the  queue  is  small  compared 
to  average  transmission  rate  of  50,000  bits  per  second  on  the 
link.  Consequently,  we  will  treat  the  rates  at  which  the  queue 
lengths  change  as  zero. 

Let  be  the  average  rate  at  which  commodity  k  travels 
over  link  (i,j).  F^  is  non-negative.  Let  f^j  be  the  aggre- 
gate  flow  on  link  (i,j),  i.e.,  f . .  =1  F. ..  Let  c..  be  the 

1J  K  1J  XJ 

capacity  of  link  (i,j).  Let  R^  be  the  average  rate  at  which 
commodity  k  originates  at  node  i.  is  non-negative  for 

i^k.  The  consumption  of  commodity  k  at  node  k  implies 

The  conservation  of  each  commodity  at  each  node  and  our 
treatment  of  the  queue  lengths  as  zero  gives 


5 .Fx .  =Z,F^.  +  Rk 
j  i  ]  ll  l  l 


16  V  ,  ' 


(1.1.2) 
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The  network  cost  will  be  taken  to  be  the  mean  transmission 
cost 


1 

r 


(1.1.3) 


where 

r  Wit*.  R>t  H-1.4) 

tj_j(f4j)  might  be  1,  representing  the  unit  use  of  link  (i,j) 
in  which  case  the  network  cost  is  the  mean  hop  length.  Alter¬ 
nately,  might  be  (c^  -  f^)-1  which  is  a  standard 

approximate  formula  for  the  mean  delay  on  link  (i,j)  in  a  store 
and  forward  network  [Cantor  and  Gerla  74]  .  We  will  write  the  network 
cost  as 


J(f) 


'ifW 


(1.1.5) 


The  link  cost  J^j(f^j)  is  a  function  of  the  flow  on  that  link. 
Our  routing  problem  formulation  is 


minimize  Z-  .J. . (f. .) 

il  il  il 


subject  to  f .  .  =  E,  F  .  . 

J  il  k  ii 


V  —  y  —  pk 

Vim  lnni  Ri 


f  .  .  <  c  .  . 

il  -  il 


F.  .  >  0 
il  ~ 

( i ,  j )  e  I ,  ks  J 


(1.1.6) 
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The  set  of  feasible  flows  is  defined  to  be  the  set  of  varia¬ 
bles  f,F  that  satisfies  the  constraints  of  (1.1.6).  This 
thesis  assumes  that  on  the  set  of  feasible  flows  the  network 
cost  is  twice  continuously  differentiable  with  positive  partial 
derivatives  and  non-negative  second  partial  derivatives. 

Almost  all  of  the  results  about  the  routing  alogrithms  in 
chapters  two  and  three  hold  only  when  the  input  flow  R  is 
constant.  Rather  than  preface  many  remarks  with  the  clause, 
"Assuming  R  is  constant...,"  we  here  assume  that  R  is  constant 
in  the  rest  of  the  thesis. 

As  stated  in  appendix  A,  the  optimal  flow  of  (1.1.6)  is 
the  one  in  which  the  flow  travels  along  the  path  of  the  least 
incremental  cost.  If  the  derivative  of  is  taken  to 

be  the  distance  of  the  link  (i,j)  then  the  optimal  flow  is  a 
shortest  distance  flow.  We  denote  this  link  distance  as 


’ij 


3Ji3(V 

3fij 


(i» j)e- 


(1.1.7) 


By  our  assumptions  about  the  network  cost  function  we  have 


g .  .  > 


( i ,  j  )  e  I 


(1.1.8) 


1 . 2  A  Distributed  Routing  Algorithm 

We  introduce  the  basic  details  of  distributed  routing  with 
the  following  routing  algorithm.  Recall  that  commodity  k  is 
the  flow  destined  for  node  k.  As  we  often  describe  what  is 
happening  one  commodity  at  a  time,  we  will  just  as  often  find 
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it  convenient  to  omit  the  superscript  k.  If  (i,j)e£  or 
(j,i)e;  then  we  say  that  j  is  a  neighbor  of  i. 

In  the  following  algorithm  every  node  has  a  favored 
neighbor  for  each  commodity.  Initially,  we  let  this  be  the 
neighbor  on  any  path  with  the  fewest  links  to  the  destination. 
For  each  commodity,  is  the  distance  from  i  to  the  destina¬ 
tion  via  the  favored  neighbors.  Each  node  i  selects  as  its 

new  favored  n  ghbor  the  neighbor  m  that  minimizes  g .  +V  . 

lm  m 

(g^m  is  given  in  (1.1.7)).  (We  will  later  show  that  the  new 
favored  neighbors  determine  a  tree  directed  into  the  destination.) 
The  proposed  flow  F'  is  the  one  that  travels  to  the  destination 
via  the  new  favored  neighbors.  The  new  flow  generated  is  that 
convex  combination  of  the  present  flow  and  the  proposed  flow 
that  minimizes  the  network  cost. 

The  following  describes  one  iteration  of  the  algorithm. 

We  assume  that  the  initial  flow  is  feasible. 

I.  The  following  steps  are  done  for  each  commodity. 

A.  The  destination  sends  the  signal  "v(jest  =  0"  to 
its  neighbors. 

B.  Each  node  i  waits  until  it  receives  V  from  its 

n 

favored  neighbor  n.  Then  node  i  sends  the  follow¬ 
ing  to  its  neighbors. 


V.  =  g.  +  V 
i  ^in  n 


C.  Each  node  i  waits  until  it  receives  from  every 
neighbor  j.  Then  it  waits  until  it  receives  F^ 
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wamtm 


from  every  neighbor  j  for  which  Vj  >  V^.  Node  i 
automatically  assumes  =  0  for  those  neighbors 

j  for  which  Vj  <_  V^.  Node  i  then  determines  its 
new  favored  neighbor  m  by 

m  =  arg-min  {g^+Vj} 

The  proposed  flow  is  F!  =  £„F!.  +  R.  and 

Fjj  =  0  if  j/m.  F^  is  sent  to  node  j  if 

V.  >  V. . 
i  3 

II.  Each  node  i  sends  the  set  { f !  . (=£,  F !^) }  to  every  node. 

1  j  K  1  j 

III.  Every  node  determines  the  y,  0  <_  y  <_  1,  that  minimizes 
J ( (1-y ) f+yf ' )  while  satisfying  (l-y)f+Yf'  <  c.  For 
each  commodity  and  link  (i,j)  the  new  flow  is 
(l-y)Fi;j  +  yF!.. 

In  that  last  step  a  central  node  could  receive  f',  determine  y, 
and  send  this  information  out  to  all  nodes.  This  would  reduce 
the  communication  overhead,  but  would  also  introduce  a  delay 
before  every  node  learns  the  value  of  y. 

We  now  briefly  check  that  the  above  algorithm  generates  a 
feasible  flow.  For  each  commodity  and  each  node  i  we  have 
for  the  n  and  m  of  step  B  and  C, 


V. 

i 


g .  + 

^in 


V_ 


>  g  .  + 

-  ’im 


V 


m 


> 


V 


m 


(1.2.1) 
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where  the  last  inequality  comes  from  (1.1.8).  Thus,  the  new 
favored  neighbors  determine  a  tree  rooted  into  the  destination. 
The  proposed  flow  travels  on  this  tree  and  satisfies  all  of  the 
constraints  of  (1.1.6)  except,  possibly,  f'  <  c.  Then,  since 
f  is  feasible,  any  flow  satisfying  [1-y) £  +  y£‘  £  c  with 
0  £  y  £  1  is  feasible.  This  shows  that  the  algorithm  is 
feasible. 

When  the  link  costs  depend  on  the  flow  and  the  network 
is  moderately  to  heavily  loaded,  the  optimal  flow  usually  has 
branches,  i.e. ,  multiple  paths.  In  this  situation  a  routing 
algorithm  should  not  only  indicate  which  path  the  flow  is  to 
be  shifted  to  but  also  how  much  of  the  flow  is  to  be  shifted. 
The  routing  algorithms  of  chapter  two  do  this  job. 

We  end  this  chapter  with  an  example.  We  start  with  the 
network  of  figure  1.2.1  and  use  only  one  commodity,  that  des¬ 
tined  for  node  d.  The  link  capacities  are  c^^  =  5.  The  input 
flows  are  R^  =  3 ,  R2  =  2,  (and  R^  =  -5).  The  sum  of  the  input 
flows  is  r  =  5.  The  link  cost  functions  are 


■W  *  5  •  5-fTj 


For  the  initial  flow  we  have  =  3,  F 


2d 


following 


(1.2.2) 

2 .  With  the 


3f 
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(5-f .  . 
13 


(1.2.3) 


we  have  for  the  initial  flow,  gld  =  1/4,  g2d  =  1/9 ,  g^2  = 

g21  =  1/25.  Applying  step  IB  of  the  algorithm  we  have  V ^  *  1/4, 


~  1/9-  Node  d  remains  the  favored  node  of  2.  Node  2 
becomes  the  favored  node  of  1  because  1/4  >  1/25  +  1/9.  The 
proposed  flow  is  F£2  =  3,  =5.  In  step  III,  the  optimal 

Y  will  be  found  to  be  .3232.  The  new  flow  turns  out  to  be  the 
optimal  flow  with  cost  .4178.  The  initial  flow  had  .4333  for 
its  network  cost.  If  each  node  had  minimized  its  delay  from 
the  destination  then  the  scaling  y  above  would  have  been  .4398 
and  the  resulting  network  cost  is  .4198. 
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Note  l.N.l.  Schwartz  and  Stern  [30]  summarizes  the  routing 
schemes  used  in  present  day  networks.  None  except  ARPANET 
uses  distributed  routing.  Many  do  have  distributed  communi¬ 
cation  for  notification  of  congestion.  In  response  to  conges¬ 
tion,  these  networks  change  the  routing  to  some  predetermined 
path.  None  of  the  networks  optimizes  the  network  delay.  The 
network  that  comes  closest  to  this  is  the  ARPANET  [McQuillan 
et  alia,  80] .  It  takes  the  present  delays  in  the  network, 
finds  the  paths  with  the  smallest  delay  (not  incremental  delay) 
and  uses  these  paths  ignoring  the  fact  that  the  new  routing 
would  change  the  link  delays. 

Note  l.N.2.  For  centralized  routing  there  are  many  techniques 
to  find  the  optimal  routing.  Among  them  are  Cantor  and  Gerla 
[74],  Frank  and  Chou  [71],  the  flow  deviation  method  [Fratta, 
Gerla,  and  Kleinrock  73]  and  Schwartz  and  Cheung  [76] .  The 
routing  algorithm  given  in  section  1.2  is  a  distributed 
approximation  of  the  flow  deviation  method.  In  the  exact  flow 
deviation  method  the  proposed  flow  F'  is  sent  down  the  path 
with  the  shortest  distance  wrt  g. 

Note  1 . N . 3 .  Agnew  [76]  indicates  that  the  difference  between 
using  the  paths  with  the  shortest  delay  and  using  the  paths 
with  the  shortest  incremental  delay  is  largest  at  medium 
leadings.  Rudin  [80]  indicates  that  the  effect  of  using 
frequent  routing  determinations  is  most  beneficial  at  medium 
loadings.  The  reason  is  that  at  low  loadings  the  flow  usually 


follow  the  path  with  the  fewest  links.  At  high  loadings,  the 
highest  cost  links  form  a  cut  set  and  the  routing  either  avoids 
the  cut  set  or  balances  the  flow  through  the  cut  set  so  that 
the  cost  of  the  links  are  about  the  same. 

Note  l.N.4.  Our  routing  criteria  is  that  the  routing  should 
become  optimal  if  the  source-destination  traffic  remains  con¬ 
stant.  Since  this  traffic  changes  one  might  try  to  improve  on 
the  routing  in  between  routing  determinations.  Rudin  [761 
reports  some  benefit  in  using  something  close  to  the  actual 
delay  to  the  neighbor  plus  the  reported  delay  from  the  neigh¬ 
bor  to  the  destination  in  determining  the  shortest  path.  One 
might  also  predict  the  interim  delay  as  a  function  of  the  flow 
sent  to  the  destination  by  means  the  second  derivative  methods 
of  chapter  two  in  this  thesis. 


Chapter  IX. 
Second  Order  Routing 
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The  routing  algorithms  of  this  chapter  generate  a  flow 
change  F  that  reduces  the  quadratic  approximation  of  the 
cost  change.  The  convergence  of  these  algorithms  is  covered  in 
the  next  chapter. 

2 . 1  Notation 

We  will  be  using  the  vector  version  of  formulation  (1.1.6) 
given  in  the  first  chapter  along  with  some  new  quantities. 

Let  N  be  the  number  of  nodes,  L  be  the  number  of  links,  and 

C  be  the  number  of  commodities.  From  section  1.1  we  have 

k  k 

C=*N.  The  commodity  flow  F  is  Lxl  and  the  input  flow  R  is 

Nxl.  For  the  most  part  we  work  with  one  commodity  so  it  will 

be  convenient  to  let  F  and  R  be  the  flow  of  a  fixed  commodity. 

The  destination  of  this  commodity  will  be  called  node  dest. 

Let  the  following  NxL  node-link  incidence  matrices  be 

defined. 

+  f  1  if  n=i 

En,lj  {  0  if  n*i 

[  1  if  n-j 

E  .  .  :=  <,  ne.V,  (i,j)el  (2.1.1) 

n'13  [  0  if  n*j 

En,ij  En,ij  ~  En,ij 

The  conservation  equation  (1.1.2)  is  then  E+F  *  E  F+R. 


Let  the  node  flow  T  be  defined  by 


T  : =  E+F 
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=  E  F  +  R 


(2.1.2) 


T  is  Nxl.  A  routing  variable  0  is  defined  to  be  any 
non-negative  LxN  matrix  that  satisfies 


Z  ■  \b  •  - 
IT  JO 


1  if  n=i^dest 
0  if  n^i,  i=dest,  or  n=dest 


(i/j)ei/  ne;V 
(2.1.3) 


Let  the  routing  fraction  $  be  any  (usually  unique)  routing 

variable  such  that  if  T.^0  then  <t>  ■  .  ■  =  F.  ./T.  . 

i  13,1  13  1 

If  c() ^ j  is  taken  to  be  the  row  of  0  that  corresponds  to 

link  (i,j),then  from  the  definition  of  a  routing  variable,  9 

is  non-zero  only  at  4 .  .  .  .  We  have  F..=0..T=0..  .T.. 

It  will  be  convenient  to  take  0. ,  to  be  the  element  0. .  • . 

13  13 ,  J- 

Then  F^  =  0^T^.  We  will  take  0^  to  be  the  column  of  0  that 
corresponds  to  node  i.  The  non-zero  elements  of  0iTi  are  the 
flows  that  leave  node  i. 


F  =  0T 


(2.1.4) 


Let  F  be  a  change  in  F.  Let  F*  *  F  +  F  be  the  new  flow 
after  making  the  change.  Let  T'  and  0’  correspond  to  F'. 

Let  T  *  T'  -  T  and  0=0'  -  0.  From  (2.1.2)  and  (2.1.4)  we 
have  the  basic  equations  involving  these  quantities.  As 
discussed  near  the  end  of  section  1.1,  R  is  assumed  to  be 
constant . 


T'  =  E  F'  =  E  F’  +  R 
T  =  E+F  =  E_F 


F  '  =  a  '  T  1 


(2.1.5) 

(2.1.6) 
(2.1.7) 
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F  =  o'T'  -  <1>T  (2.1.8) 

=  4>T  +  )T  +  <<»T  (2.1.9) 

The  last  equation  comes  from  the  expansion  of  <j>'  and  T' 
in  the  previous  equation. 

The  definition  of  ip  implies  that  a  change  in  any  routing 
variable  such  as  $  satisfies 


=  0  <j>ik  +  0ik  >  0  (i ,k) e  3  (2.1.10) 

If  >  0  we  say  that  j  is  a  downstream  neighbor  of  i. 

A  node  k  is  downstream  of  i  if  it  is  a  downstream  neighbor  of 
i  or  if  it  is  downstream  of  a  downstream  neighbor  of  i.  ^  is 
said  to  be  loop free  if  no  node  is  downstream  of  itself. 

Let  Q(f)  be  the  quadratic  appoximation  of  the  cost  differ¬ 
ence  J(f+f)  -  J(f)  . 


(2.1.11) 


(2.1.12) 


(2.1.13) 


By  our  assumption  about  the  network  cost,  stated  in 
(1.1.6),  g„  >  0  and  h^  _>  0. 

Letting  g  and  h  be  lxL  vectors  and  the  square  of  a  vector 
be  the  vector  of  squares,  (2.1.11)  may  be  rewritten  as 
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Q ( f )  :=  gf  +  \  hf2  (2.1.14) 

Because  Q(f)  is  quadratic  it  is  symmetric  about  its 
minimum.  Since  Q(0)  =  0,  if  f  minimizes  Q(f)  then  Q(2f)=0. 

We  are  not  able  .to  minimize  Q(f)  directly  with  a  distributed 
routing  algorithm.  When  we  upper  bounded  Q(f)  by  other  qua¬ 
dratics  and  minimized  those  bounds  the  only  solid  thing  we 
could  say  about  the  result  was  that  Q(f)  <_  0.  In  that  case, 
we  rather  have  Q(2f)  <_  0.  The  two  distributed  routing 
algorithms  of  this  chapter  generate  a  flow  change  f  such  that 
Q(2f)  <  0. 

We  have 

Q(2f)  =  2gf  +  2I..1 l-.f^ 

-  +  2iijhij<vij>2 

<  2,IkFk  ♦ 

=  2Zk(gFk  +  Ch(Fk)2)  (2.1.15) 

The  routing  algorithms  of  this  chapter  make  Q(2f)  non-posi- 
~  "2 

tive  by  making  gF  +  ChF  non-positive  for  every  commodity. 

2. 2  An  Algorithm  Using  Second  Derivatives 

As  we  shall  show  subsequently,  the  following  algorithm 

'2 

has  the  property  that  gF  +  ChF  is  non-positive.  For  the 
first  iteration  of  this  algorithm  we  assume  that  <t>  is  loopfree. 
For  each  iteration  the  following  steps  are  done  for  each 
commodity. 
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IA.  The  destination  sends  the  sianal  "G,  =0,  H,  =0, 

3  dest  dest 

Sdest=0"  to  its  nei<3hbors- 

IB.  When  each  node  i  receives  and  Hj  and  S  ^  from 

every  j  such  that  4^  >  0,  node  i  passes  the  follow¬ 
ing  values  to  its  neighbors 


IC.  When  each  node  i  receives  the  G,  H,  and  S  values  from 
every  neighbor  and  the  size  of  the  new  incoming  flow 
from  every  neighbor  k  such  that  Sk  >  S i  or 
4>ki  >  0  then  node  i  determines  the  new  node  flow 
Ti=^kFki  an<*  $  which  minimizes  the  following 


(2.2.1) 


such  that  E.$v.=0  and  .  >  0  and  4> .  .  =0  if 

J  *3  13  13  —  13 

S.  <  S.  and  4.  .  =  0.  The  new  flow  is  F !  .=  (<)>.  .  +4> .  . )  T ! 

1  —  3  J-3  13  13  13  1 

This  is  passed  down  to  each  node  j . 

The  node  distance  G^  is  the  average  distance  over  which  the 
input  flow  travels  to  reach  the  destination.  Note  2.N.2 
shows  that  G^  is  the  marginal  change  in  J  with  respect  to  R^ 
when  $  is  fixed.  The  watershed  distance  is  the  largest 
node  distance  at  node  i  or  downstream  of  node  i.  Its  only 
role  is  to  prevent  a  deadlock  from  occurring  in  the  algorithm. 
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i  is  assumed  to  be  loopfree  so  there  is  no  deadlock  in 
step  IB.  7o  make  sure  that  there  is  no  deadlock  in  IB  in 
the  next  iteration  we  must  make  sure  that  $'=$+$  is 
loopfree . 

If  either  (i)  <fc^j  >  0  or  (ii) (i,j)e&  and  >  Sj  then 

we  say  that  j  is  a  downhill  neighbor  of  i  or  that  i  is  an 
uphill  neighbor  of  j .  In  step  IC  node  i  receives  the  size 
F' .  from  every  uphill  neighbor  and  constrains  <j> .  .  =  0,  i.e. 
$ij+cpij  *  0,  if  j  is  not  a  downhill  neighbor.  We  say  that  k 
is  downhill  of  i  if  it  is  a  downhill  neighbor  of  i  or  is 
downhill  of  a  downhill  neighbor  of  i.  No  deadlock  occurs  in  IC 
if  no  node  is  downhill  of  itself.  Also  <i>|j  is  non-zero  only  if 
j  is  a  downhill  neighbor  of  i  so  if  no  node  is  downhill  of 
itself  then  <p '  is  loopfree.  Thus,  to  show  that  the  algorithm 
is  feasible  we  only  have  to  show  that  no  node  is  downhill  of 
itself . 


By  the  definition  of  downhill  and  the  definition  of  S 
(in  IB)  ,  if  j  is  a  downhill  neighbor  of  i  then  Sj  <_  with  equal¬ 
ity  only  if  j  is  a  downstream  neighbor  of  i.  Thus,  if  j  is 
downhill  of  i  then  Sj  <_  S ^  with  equality  only  if  j  is  down¬ 
stream  of  i.  Thus,  a  node  is  downhill  of  itself  only  if  it 
is  downstream  of  itself.  But  (6  is  loopfree  so  no  node  is 
downhill  of  itself.  This  proves  that  the  algorithm  is 
feasible. 

In  step  IC,  node  i  needs  to  know  before  it  can  deter¬ 
mine  ; .  Rather  than  have  node  i  take  the  time  necessary  to 
measure  T|  we  have  every  uphill  neighbor  k  send  the  size  F,^ 
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to  node  i.  Calculating  the  new  flow  is  also  useful  if 
several  iterations  of  the  algorithm  per  measurement  interval 
is  desired. 

We  mention  here  the  effect  Tj  has  on  the  optimal  . 

If  TV  is  large  then  the  squared  term  in  (2.2.1)  is  penalized 
so  the  optimal  is  small.  If  T|  is  small  then  the  optimal 
will  be  large. 

'2 

Remark  2.2.1.  Algorithm  I  makes  gF  + ChF  non-positive. 

Proof.  Algorithm  I  minimizes  (2.2.1)  which  is  given  in  step 
IC.  Thus,  any  change  from  the  optimal  <j>  to,  say,  \p  where  -p 
satisfies  the  constraints  of  IC,  is  a  non-descent  change  in 
(2.2.1) . 

SiCgij+Gj+CL(hij+Hj)i»ijTi1  (iij“iij  5  1°  (2.2.2) 

The  expression  in  the  brackets  is  the  gradient  of  (2.2.1). 

At  >p  =  0,  the  above  inequality  reduces  to 

tjWlj+GjHij  ♦  WjlkySjlJljti  <  o  (2.2.3) 

Multiplying  by  Tj  and  summing  over  i  gives 

(g+GE-)  <j>T'  +  CL(h+HE~)  ( 4>T  *  )  2  <_  0  (2.2.4) 

We  now  develop  some  expressions  for  G  and  H.  From  IB, 


Gi  * 


;  .  g .  .  i> .  .  + 

irir  13 


y  .g.< 
3  1 


In  vector  form,  this  is 


G  —  go  +  GE  p 


(2.2.5) 


(2.2.6) 


Subtracting  the  last  term,  gives 
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G(I-E  $)  =  g<p  (2.2.7) 

Let  the  following  be  defined. 

aN  :=  ( I-E~cJ  )  ~ 1  (2.2.8) 

Mote  2.N.1  shows  that  the  inverse  exists  and  that  its  terms 
are  non-negative  and  less  than  one.  0N  is  NxN.  9^  ^  can  be 
interpreted  as  the  fraction  of  that  appears  at  node  k. 
Multiplying  (2.2.7)  by  0N  gives 

G  =  g<j)9N  (2.2.9) 

That  is , 


G  =  E  ■  •  g  ■  ■  di  •  ■  0  • 
n  i,n 

Similarly,  for  H 


(2.2.10) 


H  =  E  •  -h.  .  dif  .  9  . 

n  i]  i]  i]  i,n 


(2.2.11) 


With  (2.2.9),  the  first  term  of  (2.2.4)  is 


(g+GE  )  <J>T '  =  (g+gbQ^E  )4>T' 


=  gd+ie^E')^’ 


(2.2.12) 


We  now  show  that  this  is  gF.  From  (2.1.9) 


F  =  0T  +  $T  +  jT 


=  jT'  +  i)T 


Putting  this  in  T  =  E  F  gives 


(2.2.13) 


Subtracting  the  last  term  from  both  sides  and  then  multiplying 


by  9jj  gives 

T  =  (2.2.14) 

Putting  this  in  (2.2.13)  gives 

F  =  OT'  +  4>0ne“<1)T  '  (2.2.15) 

=  (I+oQjjE-)  <0T'  (2.2.16) 

Using  this  in  (2.2.12)  gives 

(g+GE~) 0T'  =  gF  (2.2.17) 


Thus ,  (2.2.4)  becomes 

gF  +  CL  (h+HE~)  (<J)T ' )  2  <  0  (2.2.18) 

We  now  show  that  ChF  is  less  than  the  second  term  above. 

Using  (2.2.15) , 


ChF2  *  Ch[<j>T'+(j>eNE"bT'  ]  2 


=  Cl  .  .h.  .  [A  .  .T!+I  <*..©-  0  T'  ] 

ij  lj  i  mnfi3  i,n*mn  m 


(2.2.19) 


Since  Tj  „=0 ,  there  are  no  more  than  L  term  in  the  brackets, 
dest 

Upper  bounding  the  square  of  sums  with  L  times  the  sum  of 
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ChF  <  CLZ  •  .h,  _ _ _  .  . 

i]  i]  i:  i 


[  (4>  .  .T!  )  2  +  Z  1>2  -92  (d>  T'  )  2] 


mn"  13  1 ,n  mn  m 


CLZ  •  .h.  .  ($  .  .T!  )  2  +  CLZ  .  .  h .  .  Z  <h2-d2  (<p  T’)2 

13  13  13  1  13  13  mnyi3  i,n  *mn  m' 


CLhur>2  +  a^iijhyt^tiv;)2 


(2.2.20) 


From  the  fact  0  <  9.  <  1  we  have  0.  >07  .  Using  this 

—  1 , n  —  1 ,n  —  i,n 

and  the  fact  that  h^  is  non-negative  in  (2.2.11)  gives 


H  >  Z  •  .h.  .<J>2  .Q2 
n  -  13  13  13  i>n 


(2.2.21) 


Using  this  in  (2.2.20)  gives 

ChF2  <  CLh ( <fcT ' ) 2  +  CLZ„  H  (AT' ) 2 


mn  n  mn  m 


*  CLh(0T')2  +  CLHE*(0T')2 


=  CL(h+HE~)  (<*T '  )  2  (2.2.22) 

Using  this  in  (2.2.18)  then  gives  the  remark. 

The  next  remark  says  that  there  exists  a  flow  cost 
below  which  alogrithm  I  makes  the  network  cost  decrease 
monotone ly. 

Remark  2.2.2.  Suppose  that  the  second  derivative  of  each 
J  —  is  positive  and  that  the  initial  flow  f®  is  such  that, 
for  every  link  (i,j), 
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max 

f 


3  ^Vjjtf)  <  j(f°) 


„f2 

3  Ij 


< 


2  min 
f 


J(f) 


J(f°) 


(2.2.23) 


Then  algorithm  I  makes  the  network  cost  decrease  monotcnely. 
Proof.  Let  f  be  any  flow  such  that  J{f)  <_  J(f°)  and,  for  f, 
suppose  that  algorithm  I  generates  f.  Suppose  to  the  contrary 
that  J(f+f)  >  J (f ) .  Because  of  the  strict  inequality 
J(f+f)  >  J(f),  we  have  f?0.  With  remark  2.2.1  and  equations 
(2.1.15)  and  (2.1.14),  algorithm  I  is  seen  to  give 

gf  +  hf2  <  0  (2.2.24) 


By  the  first  assumption  of  the  remark  we  have  h^  >  0  for 
all  links  (i,j).  Since  we  also  have  f^O,  gf  is  negative. 

gf  <_  -hf2  <  0 

This  means  f  is  in  a  descent  direction.  Since  J  is  continuous, 
there  exists  an  a ,  0  <  a  <  1,  such  that  J(f+af)  =  J(f).  A 

Taylor  series  expansion  of  J(f+af)  gives 


J  (f+af ) 


J(f)  +  agf  + 


a 

T 


32J. 


Z  .  . 
13 


iiliiii 


*f? 
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(2.2.25) 


where  is  between  f^j  and  f^j+af^.  We  ^ave  J(f+£)  J(f  ). 

Then  (2.2.23)  allows  us  to  say 


f 


IWj,*  T"-1 
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i!iii 


!£ii 


3f 


=  2hij  (2.2.26) 

Using  this  inequality  in  (2.2.25)  leads  to 

0  =  J ( f+af )  -  J(f)  <  agf  +  a2hf2  (2.2.27) 

<•  ct  (gf+hf2) 

<  0  (2.2.28) 

This  is  0  <  0,  a  contradiction.  Therefore,  the  network  cost 
decreases  monotonely. 

2 . 3  An  Algorithm  Not  Using  Second  Derivatives 

Let  P  be  a  number  greater  than  max  h^j .  The  following 
algorithm  is  less  precise  them  the  previous  one  as  it  uses  P 
instead  of  the  whole  vector  h.  It  does  have  two  advantages. 

H  is  not  passed  through  the  network.  Also,  the  optimizing 
^  is  independent  of  T^,  avoiding  the  need  to  pass  the  size 
of  the  flow  change  through  the  network.  Node  i  simply  cal¬ 
culates  =  <t>j_  +  and  proportions  out  with  the  traffic 
that  comes  into  the  node. 

In  the  first  iteration  of  the  algorithm  we  assume  u  is 


loopfree . 


IIA.  The  destination  sends  the  signal  "G.  *.  =  0,  s.  =  0" 

dest  dest 

to  its  neighbors. 

IIB.  When  each  node  i  receives  Gj  and  Sj  from  every  j  such 
that  0^  >  0,  then  node  i  passes  the  following  values 
to  its  neighbors 

G .  =  Z  .  (g .  . +G  . )  <i) .  , 

S .  =  max{G, ,  max  S . } 

IIC.  when  each  node  i  receives  the  G  and  S  values  from 
every  neighbor  it  determines  the  0^  which  minimizes 
the  following 

Ej  (^ij+Gj)<fij  +  \  PCLNZj0^jTi  (2.3.1) 

such  that  I-0..=O,  0..+0..  >  0,  and  0..=O  if 
J 

S •  <  S .  and  0 . .=0. 

1-3  13 

The  new  flow  is  F! .  =  (0. .+0. .  )T!  where  T!  =  E,F'.  The  demon- 

lj  1J  Ij  x  IKK 

stration  that  this  algorithm  is  feasible  is  the  same  as  for 
the  previous  algorithm  except  that  we  do  not  have  to  worry 
about  a  deadlock  in  step  IIC. 

Remark  2.3.1.  Algorithm  II  makes  gF  +  PC ( F [  non-positive. 

-  2  -  ~  2 

Note  that  a  non-positive  gF+PC|F[  makes  gF+ChF  non-positive. 
Proof.  Using  the  same  argument  as  for  (2.2.3),  the  optimal  0^ 


satisfies 
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Y9ij+<V*ij  +  iPCM!:j»tjTi  i  o 

Multiplying  by  T|  and  then  summing  over  i  gives. 

(g+GE~)  <t>T*  +  jPCLNZi]  <J»i  1  2TiT|  <  0 
By  (2.2.17)  the  first  term  is  gF. 
gF  +  i  PCLNE  J  <$  .  |  2T.  T!  <0 

^  -LX  X  1 

For  any  variable  x,  let  x+  =  max{x,0)  and  x~  =  max{0,- 
We  have  x  =  x+-x  . 

We  have  >_  -  T~  and  _>  -  T~. 

both  of  these  inequalities  in  (2.3.4)  gives 

gF  +  j  PCLN| 0(T-T”) I2  <  0 

We  now  show  that  |f|2  <_  j  LN  |  <J>  (T-T- )  |2  and  this  in  the 
will  prove  the  remark.  From  (2.1.9), 

F  =  <j>T  +  rtT  +  $T 

Expanding  T  into  T+-T  gives 

F  =  <bT  -  <pT~  +  $T+  +  <J)T+  -  <j>T_ 

=  $(T-T')  +  $'T+  -  4>T~ 

From  this  we  get, 

F~  <  <t>~  (T-T-)  +  (J>T~ 

With  T~  <  E-F~, 

F~  <  -Ji"(T-t“)  +  $E-F- 


(2.3.2) 

(2.3.3) 

(2.3.4) 
{}  . 

Using 

(2.3.5) 

above 


2.3.6) 

2.3.7) 
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Subtracting  the  last  term  from  both  sides, 

( I-0E-)  F~  £  tp~  (T-T-)  (2.3.8) 

Let  the  following  be  defined. 

0L  :=  (I-iiE-)”1  (2.3.9) 

Note  2.N.1  shows  that  the  matrix  inverse  exists  and  that  its 

terms  are  non-negative  and  no  greater  than  one.  0^  is  L*L. 

9. .  is  the  fraction  of  0 _ R  that  appears  on  link  (i,j). 

i]  , mn  mn  m 

Multiplying  (2.3.8)  by  &L  gives 

F"  <  0  (T-T~)  (2.3.10) 

—  Li 

Elementwise,  this  is 


(2.3.14) 
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.  «■,-  ....  — 


BSE5!SS«! 


Using  this  in  (2.3.13)  gives 


|F“|2  <  N ( I  <fr~  (T  -T~ ) ) 2 
—  mn  inn  in  m 


With  E  A _ =  T  £  I  <p  I,  the  above  becomes 

n  mn  2  n 1  mn  1 

i;'i2  i?  «u*  mi<v*;>>2 


We  do  the  same  development  fop  F  .  From  (2.3.6), 


F+  <  0+(T-T‘)  +  <j>'T+ 

With  T+  <  E_F+, 

F+  <  4>+  (T-T-)  +  0'E_F+ 

Subtracting  the  last  term  from  both  sides, 
(!-<(> ’E“)F+  <  0+(T-T~) 


(2.3.15) 


(2.3.16) 


(2.3.17) 


(2.3.18) 


(2.3.19) 


Now  we  define  the  following 

:=  (I-^’E-)"1  (2.3.20) 

It  has  properties  similar  to  9^.  The  next  few  steps  parallel 
(2.3.9)  to  (2.3.15).  This  gives 

12-3-21> 

usl"9  Vm  •  I 

:F+j2  <  J  (£mJ*mnl  (Tm-Tj  )2  (  7  •  3.22) 

1  —  2  mn 1  mn 1  m  m 

Using  this  and  (2.3.16)  in  |Fj2  =  |F-t’j2  +  jF  |2  gives 
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Note  2.N.I.  The  routing  fraction  $  is  loopfree  if  there  does 

not  exist  a  loop  (n, ,n_ , . . . ,n_ ,n  . ,  =  n, )  on  which  _  >0 

1  2.  m  m+l  i  nini+l 

for  i  =  This  note  shows  that  if  the  routing  fraction 

is  loopfree  then  for  a  given  R  there  is  a  unique  F  and  T. 

From  EF=R  and  E=E+-E~  and  then  from  F=<j>T  and  T=E+F  comes 

R  =  E+F  -  E~F 

=  T  -  E_0T 

=  (I-E*0)T  (2.N1.1) 

If  I-E  $  is  invertible  then  T  will  be  unique.  We  show 
the  invert ibility .  We  have  E~$^  =  ^ 

(E'*'32,i  -  l2-N1-2> 

Inspection  of  this  leads  to  the  following  interpretation. 

(E  i>)^  .  is  the  fraction  of  R.  that  -ppears  at  node  j  after 

j  /  •*■  i 

—  N 

travelling  on  exactly  n  links.  Since  0  is  loopfree,  (E  0)‘=O. 

We  have 

(I-E~0) (I+E~0+ (E~0 ) 2  +  ...  +  (E~0 ) )  =  I  -  (E~0)N  =  I 

Since  the  product  on  the  left  hand  side  equals  the  identity 
matrix,  we  get 

(I-e'6)”1  =  I  +  E~0  +  (E"0)2  +  ...  +  (e”0)N_1  (2.N1.3) 

This  shows  that  (I-E  J>)  ^  exists  and  has  non-negative  terms. 
(2.N1.1)  becomes 


T 


(I-E”0)_1R 


(2.N1.4) 
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(I-E~$) “1 •  is  the  fraction  of  R.  that  appears  in  T .  .  Since 
there  is  no  looping,  the  terms  of  (I-E  $)  are  no  greater 
than  one.  We  have  ( I— E~ <J> )  ^  ^dest  equal  0  if  j^dest  and  1  if 
j=dest .  Using  (2.N1.4)  in  F=$T  gives 

F  =  (I-E_<jj)”1R  (2.N1.5) 


We  now  wish  to  show  that  I~cJ>E  is  invertible 


(I-0E-)  (!+<(>£  +(0E“)2+  ...  +(0E~)N) 


=  I  -  (0E")N+1 


=  I  -  ^(E“(j,)NE" 


=  I 


Therefore  I-(}»E  is  invertible  and 

(I-sE-)"1  =  I  +  <|>E-  +  ( 4>E_ )  2  +  ...  +  UE  )N  (2.N1.6) 

-  -  2  -  N-l  - 

=  I  +  $(I+E  $+(E  ib)  +  ...  +(E  0)  )E 

*  I  +  <p  <I-E~(j>)  _1e"  (2.N1.7) 

We  have  the  elementary  equation 

«  ( I-E~4> )  =  < I-d>E~ )  <J> 

Inverting  the  factors  in  the  parentheses  gives 

(I-©E_) -1i  =  ©(I-E-!))'1  (2.N1.8) 

Using  this  in  (2.N1.6)  gives  another  expression  for  F. 
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F  =  (I-0E-)  “Hr 


(2.N1.9) 


(I-oE~)  .  1  is  the  fraction  of  0  R  that  appears  in  F... 

13  ,mn  mn  m  13 

Therefore,  it  is  non-negative  and  no  greater  than  one.  From 
(2.N1.7)  we  have 


(I-0E 


) 

lj  ,mn 


I.  . 

ij  ,mn 


+ 


4>ij  (I-E  0) 


-1 

i,n 


(2.N1.10) 


The  last  term  makes  sense  as  all  of  the  flow  on  (m,n)  reaches 
node  n.  (I-E  0)”^  is  the  fraction  of  that  flow  that  reaches 
node  i.  0^  is  the  fraction  of  this  flow  that  goes  on  link 
(i,j).  We  now  develop  an  inequality  using  the  following  pro¬ 
perty  of  $  (from  (2.1.3)) 


f  1  if  i/dest 


E .0. .  *  { 

]  i] 


(2.N1.11) 


0  if  i=dest 


l.  .  (I-dE-)"^  mT,  «  1  +  E.  .0.  ■  (I-e"0)T1 
13  1 j ,mn  1 j  13  1 ,n 


=  1  +  l  (I-E_0)'1 
i^dest  ' 


<  1  +  l  1 

i^dest 


=  N 


(2.N1. 12) 


Mote  2.N.2.  The  node  distance  G  has  some 
ties.  This  is  the  node  distance  that  was 
Bertsekas  78 ,  and  Gafni  79].  From  (2.2.9) 


interesting  proper- 
used  in  [Gallager  77, 
we  have 
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G  =  g.$  ( I— E~ (j> )  _1  (2.H2.1) 

Comparing  this  with  (2. Ml. 5)  shows  that  the  fraction  of 

in  G_  is  the  same  as  the  fraction  of  R  in  F . .  The  most 
n  n  ij  . 

important  property  of  G  is.  GR=gF 

GR  =  g<j>  (I-E~i)  _1R  =  gF  (2.N2.2) 

From  remark  A. 2-1  in  appendix  A  we  have  the  following  error 
bound 

J(f)  -  J  •  <  gf  -  E,DkFk  (2.N2.3) 

min  —  k 

where  D  is  the  shortest  distance  vector  with  respect  to 
commodity  k  and  distance  g.  Since  gf  =  E^gF  we  may  use 
(2.N2.2)  to  get 

J<£>  -  J„in  i  £kGkRk  '  Ek°kRk 

=  Ek(Gk-Dk)Rk  (2.N2.4) 

Let  us  restrict  the  routing  problem  to  one  commodity  and  use 
(2.N1.5)  to  make  J (F)  a  function  of  $  and  R,  i.e. 

JM  <(> ,  R)  =  J(<f>  (I-E"(b)"1R)  (2.N2.5) 

We  will  differentiate  this  equation  to  see  how  a  small  change 
in  $  and  R  changes  J*.  We  will  need  the  differential  of 
(I-e"d)-1.  Let  Y  =  ( I-E~ <t>)  ~ 1 .  Then,  (I-E_$)Y  =  I.  Differen¬ 
tiating  this  gives  (I-E  o)dY  -E  d®Y  =  0.  Rearranging  this 
then  gives 


1 


d(I-E'j)'1  =  dY 


=  (I-E~0) _1E~d©Y 
=  (I-E~0)  _1E~d(J>  ( ) 
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Using  the  chain  rule  in  (2.N2.5)  gives 

dJ*  =  g  [d©  (I-E~0)  _1R  +  <t>  (I-e"<0) -1E~d<J>  (I-E~<J>)  ~1R 

+  0 (I-E_0) _1dR]  (2.N2.6) 

Using  (2.N1.4)  and  (2.N2.1), 

dJ*  =  gdbT  +  GE~d0T  +  GdR  (2.N2.7) 

We  have  the  constraints , 

Zjd0ij  =  0  d0ij  +  0. •  >  0  (i,j)eL 

R.  +  dR.  >0  for  i^dest ,dR ,  .  =  -7.,,  _  dR.  (2.N2.9) 

i  i  —  dest  ‘•l^dest  i 

Since  G^est=0'  (2.N2.6)  says  that  if  dR^  and  dRdest  are 
the  only  changes  in  ©  and  R  then  J*  (and  J)  changes  by  approx¬ 
imately  G^dR^  where  the  approximation  is  better  if  the  change 
is  small. 

We  end  this  note  with  two  inequalities.  The  first  is 

G.  <  N  max  g  „  (2.N2.10) 

i  —  mn 


This  follows  from  (2.N2.1), 
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G.  =  E  . .  g  . ,  $  . ,  (I-E~4>)  “ 1 , 
i  ]ky]k*]kl  3,1 


1  -jk(max 

mn  " 


'  (*a* 


£  N  max  g 
mn 


mn 


(2.N2.11) 


The  other  inequality  is 


N (G-D) R  >  (G-D)T 


(2.N2.12) 


To  get  this  inequality  we  start  with  (2.2.6)  which  is 
repeated  here 


G  *  (g+GE  ) 


(2.N2.13) 


We  have 


D  £  (g+DE  )  <t>  (2.N2.14) 

Taking  the  difference,  and  then  successively  multiplying 
by  E  <(>  gives 

G-D  £  (G-D) E-* 

>  (G-D)  (E~4>)  2 

>  (G-D)  (E-<p)  3 


>  (G-D)  (E  -9 


N-l 
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Summing  the  column  of  the  above  chain  of  inequality  and 
adding  G-D  =  G-D  gives 

N(G-D)  >_  (G-D)  (I+E_<j)+ 2  +  ...  +  (E-$)  ^_1) 

=  (G-D) (I-E~$) _1  (2.N2.15) 

Since  G.  „=0=D,  .  and  ( I-e'a) 71 , _ .=0  for  i^dest,  we  can 

Q6SX1  Q6St  1  ; 

multiply  (2.N2.15)  by  R  and  preserve  the  inequality. 

N  (G-D)  R  >_  (G-D)  (I-E~<p  )  _1R 

=  (G-D) T  (2.N2.16) 

This  proves  (2.N2.12) 


Note  2 . N . 3 .  Another  possibility  for  H  in  algorithm  I  is  the 
following 


H 


1/2 

dest 


0 


H1/2  =  (I  .h.  .<j)2  . )  1/2  +  £.H1/24>,. 
i  13  13  3  3  30 


(2.N3.1) 


1/2 

The  square  of  H.  is  used  in  step  IC.  This  note  will 

-  2 

prove  that  this  choice  of  H  makes  gF  +  ChF  non-positive.  If 
the  proof  of  remark  2.2.1  is  reviewed  it  will  be  seen  that  it 
will  be  enough  to  show  that  (2.2.21)  still  holds.  This  condi¬ 
tion  is 


2  2 

H  >  Z • -h. . i  . 9 • 

n  -  13  13  13  1  ,n 


(2.N3. 2) 


As  in  the  transformation  of  G  from  (2.2.5)  to  (2.2.10),  we 
have  for  (2.N3.1) 
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H1//2  =  Z  .  (Z  .h.  .  4>2  . )  1//2<i>. 
n  l  3  13^13  *i,n 


(2.N3.3) 


Squaring  both  sides  and  then  reducing  the  square  of  the  sum 
to  the  sum  of  square  gives  (2.N3.2). 

It  is  not  clear  whether  this  H  is  better  or  worse  than 
the  H  of  algorithm  I.  The  one  given  there  was  chosen  for  its 
simpler  computations. 


Note  2.N.4.  Figure  2.N.4.  gives  a  non-optimal  flow  for  which 
one  iteration  of  either  algorithm  of  this  chapter  could  fail 
to  make  the  cost  function  decrease.  However,  there  is  an 
improvement  in  <j> . 


Figure  2.N.4. 

Associated  with  each  link  (i,j)  in  the  figure  is 

There  is  an  input  flow  only  at  node  1.  It  is  destined  for 

node  d . 

In  applying  one  iteration  of  either  algorithm  I  or  II  to 
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the  figure  we  have  G2=4,  G3=l,  and  at  node  1,  g12+G2  =  5 
and  =  4.  Thus,  there  is  no  change  in  the  flow  at  node 

1  or  elsewhere.  Thus,  the  cost  function  does  not  change  even 
though  flow  in  the  figure  is  not  optimal.  The  optimal  flow  is 
on  the  path  (l,2.3,d). 

At  node  2,  g23+G3  =  ^  anc*  ^2d+Gd  =  4  so  bY  either  algorithm 
I  or  II  we  have  <$23=1.  So,  we  do  have  an  improvement  in  and 
in  the  next  iteration  the  flow  will  change  and  the  cost  function 
decrease . 


Chapter  III. 

Partially  Distributed  Routing 
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This  chapter  gives  a  class  of  loopfree  routing  algorithms 
and  shows  that  each  algorithm  in  this  class  converges  to  the 
optimal  flow.  Our  objective  is  to  provide  extensive  freedom 
in  choosing  the  parameters,  so  that  heuristics  may  be  taken 
advantage  of  without  jeopardizing  the  convergence  to  the  optimum 
cost. 

The  central  form  of  these  algorithms  is  to  use  a  distri¬ 
buted  procedure,  such  as  either  of  the  algorithms  of  chapter 
two,  to  determine  for  each  commodity  a  proposed  flow  change  F; 

A  central  node  then  receives  the  aggregate  flow  changes 
f  and  determines  the  scale  y  that  minimizes  J{f+yf)  over 

*v 

0  £  y  <_  1.  For  each  commodity,  the  new  flow  is  F+yF. 

Figure  3.1  illustrates  a  potential  problem  with  this  cen¬ 
tral  form.  In  the  figure  there  is  just  one  commodity,  that 
destined  for  node  d.  The  flows  F^  are  given  next  to  the  links. 

£  and  6  are  positive.  The  link  costs  are 


Figure  3.1 
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The  two  subnetworks  determined  by  nodes  l,2,d  and  3,4,d, 
respectively,  are  each  the  same  as  that  of  figure  1.2.1.  The 
situation  depicted  by  nodes  l,2,d  might  have  come  about  when 
the  input  flows  were  R2=3  and  R^=2.  Refering  to  the  text 
associated  with  figure  1.2.1  we  see  that  the  optimal  flows  are 
<$  =  .3232,  £  =  0,  F  =  3-.  3232,  F  2  =  .3232,  and  F2d  =  2.3232. 
As  the  routing  algorithms  (such  as  those  of  chapter  two)  require 
the  flow  to  be  loopfree,  node  1  must  wait  until  5=0  before  it 
can  send  any  flow  to  node  2.  As  5  is  on  a  long  path  node  2 
will  set  the  flow  changes  F^  =  -5  and  F2d  =  5.  In  the  next 
iteration  5  will  be  zero  if  Y  =  1. 

We  are  interested  in  routing  algorithms  that  make  large 
flow  changes  so  as  to  have  rapid  convergence  if  convergence 
exists.  These  algorithms  would  over-correct  the  situation  at 
node  3.  Thus  if  5  <  .3232  then  F34  >  .3232  -  5  and  if 
5  >  .3232  then  F34  <  .3232  -  <5.  (We  assume  that  in  the  presence 
of  the  flow  change  at  node  2  the  optimal  value  of  5  will  never 
be  found.)  If  the  overcorrection  is  large  enough  then  y  will 
not  be  one.  y  will  be  between  0  and  1.  In  the  next  iteration 
0  will  be  closer  to  .3232  and  5  will  be  smaller  but  not  zero.  If 
the  overcorrection  of  5  persists  in  every  iteration  then  -  will 
approach  zero  as  the  number  of  iterations  gets  large  but  never 
become  zero.  The  algorithm  would  then  converge  to  a  non-optimal 
flow. 

The  obvious  remedy  is  to  make  sure  overcorrections  do  not 
occur  but  we  do  not  wish  to  sacrifice  rapid  convergence.  What 
we  will  do  is  have  a  distributed  procedure  generate,  for  each 
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commodity,  two  flow  changes,  P  and  F,  where  F  is  the  normal 
sized  flow  change  and  F  is  a  small  flow  change  in  a  good 
enough  direction  such  that  J (f+F)  £  J(f).  The  flow  change  F 
will  be  carried  out  regardless  of  the  scaling.  A  central  node 
receives  the  aggregate  flow  changes  f  and  F  and  determines  the 
scale  y  that  minimizes  J ( f+F+y (f-F ) )  over  0  £  y  £  1.  For  each 
commodity  the  new  flow  will  be  F+f+y (F-F) .  This  is  the  form  of 
the  ciass  of  routing  algorithms  given  in  the  next  section. 

In  the  example,  the  algorithm  waits  until  £  is  small 
enough  (this  is  made  specific  in  the  next  section)  and  then 
sets  -  -£.  In  the  next  iteration  £  will  be  zero  and  this 
will  allow  node  1  to  send  flow  to  node  2. 


3. 1  A  Generalized  Algorithm  with  Scaling 

Let  f°  be  the  initial  loopfree  flow  and  F  be  the  set 
{ f | J (f )  £  J(f°) ,f=EkFk,EFk=Rk,Fk  >  0,  keC}.  We  assume  that  on 
F  the  network  cost  J  =  (f^j)  tw*ce  continuously  dif¬ 

ferentiable  with 


44 — =^—  >  0  and  - >  0  (i,j)e3 


(3.1.1) 


Jf-j 


Let  M,  B,  A,  and  X  be  constants  satisfying 


M  > 


max 
fe  F 


max 

(i, j)e£ 


3 


(3.1.2) 


M  >  0  (3.1.3) 

B  =  2MCLN  (3.1.4) 


A  >  X  >  0 


(3.1.5) 


(Recall  that  C  is  the  number  of  commodities,  L  the  number  of 
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links,  and  N  the  number  of  nodes.) 

The  following  outline  gives  the  steps  of  one  iteration. 
For  the  first  iteration  we  assume  that  i>  is  loopfree. 


Upstream  Stage.  For  each  commodity  the  following  steps  are  done. 

A.  The  destination  node  sends  its  neighbors  the  signal 

"w  =  o " 
wdest  u  • 

B.  Each  node  i  waits  until  it  receives  the  node  distance 
Wj  from  every  j  such  that  <J>^  >  0.  Then 

1.  Wij  =  g^+Wj  j :  i>i  j  >  0  Cgi  j  was  given  in  (1.1.7).) 

2.  is  chosen  arbitrarily  from  the  interval 

[  min  w.., 
i  j* .  . >o  •‘■J  J 
J  i] 

3.  Node  i  is  'in  loop  danger’  if,  for  some  j,  >  0 

and  either  (a)  W.  >  W.  or  (b)  j  is  in  loopdanger. 

J 

4 .  Node  i  sends  and  its  loopdanger  status  to  its 
neighbors . 

Downstream  Stage.  For  each  commodity  the  following  steps  are  done. 

A.  The  set  of  downhill  neighbors  z i  is  the  set  (j)  such 

that  either  (a)  >  0  or  (b)  ^  and  j  is  not  in 

loopdanger.  If  j  is  a  downhill  neighbor  of  i  then  i 

is  an  uphill  neighbor  of  j . 

B.  Each  node  i  waits  until  it  receives  the  flow  changes 
Fk^  and  F^i  from  all  of  its  uphill  neighbors.  Then 

h  -  ZA±  T  -  Vki 

is  aribtrarily  selected  such  that  (a)  A  >_ 
and  (b) 

Z  .V  ■  .  <6?  .  >  AE  .  . 

]  l]  13  -  3  i3 


2. 


0. 


holds  for  all  0  satisfying  £^0^  = 

3*  )  is  chosen  to  minimize  either 

U)  +  §■  or 


such  that 


(b)  lj0ij  =o  *  +  ♦ij  >  0 

(c)  0i;.  =  0  if  j^Zi 

4-  is  arbitrarily  chosen  such  that 


00  SJs'ij*1J  <  «L 


(b)  0^ j  =  0  if 


b .  .  +  0  .  .  *  0 
pij  V1J 


5<  Fij  =  (<tij+0ij)(Ti-TI)  +  *ijTi  -  Fij  (recall  that 
=  max {0, -TV}  and  tT  =  max  {TV  ,0}) 


6.  n  =  arg-min 


7.  Any  satisfying  the  following  is  a  'leak' 


(a)  0  <  F  <  b'1(v.,-v.  ) 

ij  —  ij  m 


(b)  0Aj  +  0i;j  =  0 


'ij 


if  F^  is  a  leak 


ij 


=  -I 


k/n  ~ ik 


if  j=n 


0  otherwise 


47 


9.  rla  -  (♦ijnlj)(Tl-T- )  +  r±j 

10.  Node  i  sends  and  F^  to  each  downhill  neighbor. 


III.  Central  Stage. 

A.  When  each  node  i  knows  the  flow  changes  F^j  and  F^ 
for  all  neighbors  j  and  commodities ,  it  computes  the 
aggregate  quantities  f^+f^  and  f^j-f^ j  ,  and  sends 
them  to  the  central  node. 

B.  When  the  central  node  receives  all  of  these  quantities 
it  determines  y  to  minimize 


ZijJij  <fij+fij+YCfij-yij>  > 

such  that  0  <  r  <  1.  This  is  sent  to  every  node. 

C.  The  new  flow  is  F*  =  F  +  F  +  y (F-F) .  The  new  routing 
fraction  is 


>*.  = 
JO 


F*./T*  if  T*  >  0 

xy  i  i 


$ij  +  if  T7  =  0 


Each  iteration  of  the  algorithm  generates  a  set  of  possible 
feasible  flows  F*  each  depending  on  the  choice  of  the  node 
distance  W,  the  weight  7,  and  the  routing  <p  of  the  node  flow 
increment.  Let  4(F)  be  this  set  after  one  iteration.  4(F) 
implicitly  depends  on  <j> ,  A,  X,  and  B. 

The  second  algorithm  of  chapter  two  may  be  used  to 
generate  s.  In  this  case  A  >  j  PCLN  >_  X.  The  first  algorithm 
of  chapter  two  may  also  be  used  if  there  exists  a  positive  X 
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such  that 

CLZj(hij+H.)d>?j  >  Alj^  (3.1.6) 

For  these  algorithms  is  equal  to  the  maximum  value  in  the 
interval 


[  ™in  fik' 


'ii1 


and  \p  equals  <j>  +  $  .  Algorithm  I  would  minimize  IIB3a'  while 
algorithm  II  minimizes  IIB3a. 

For  this  W  (=  G)  and  \p ,  condition  IIB4a  is  automatically 
satisfied.  To  see  this  note  that  if  IIB3a  is  minimized  then 


If  IIB3a'  is  minimized  then 

In  either  case,  z j j j  -  °*  So 


<  Z  ■  if  •  •  6  ■  • 
-  3  13^13 


Wi 


(3.1.7) 


The  watershed  distance  method  used  in  the  previous  chapter  just 
prevents  loops  from  developing,  which  in  turn  prevents  dead¬ 
locks  from  occurring  in  the  algorithm.  The  loopdanger  method 
of  algorithm  is  another  way  of  doing  this.  We  will  show  this 
shortly.  The  loopdanger  method  is  slightly  harder  to  analyze, 
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gives  a  more  restricted  set  of  downhill  neighbors,  but  also 
uses  boolean  numbers  "loopdanger  status”  rather  than  real 
numbers  S  in  the  communication  between  nodes.  (For  that 
matter  the  communication  of  watershed  distance  could  be  changed 
to  "S^  is  the  same  as  G^"  or  "S^  is  different  from  and  is  ..." 
This  requires  using  more  than  just  boolean  numbers  only  when 

W 

We  now  show  that  z  is  loopfree.  This  will  avoid  a  dead¬ 
lock  in  the  downstream  stage  and  also  make  0*  loopfree.  A 
loopfree  0*  avoids  a  deadlock  in  the  upstream  stage  of  the 
next  iteration. 

We  fix  our  terminology.  Node  j  is  a  downstream  neighbor 
of  i  if  0^  >  0.  Node  j  is  downstream  of  i  if  it  is  a 
downstream  neighbor  of  i  or  if  it  is  downstream  of 
a  downstream  neighbor  of  i.  Node  j  is  a  downhill  neighbor 
of  i  if  jeZ^.  Node  j  is  downhill  of  i  if  it  is  a  downhill 
neighbor  of  i  or  if  it  is  dovmhill  of  a  downhill  neighbor  of  i. 
Upstream  and  uphill  are  the  reverse  of  the  downstream  and  down¬ 
hill  relations. 

We  assume  that  0  is  loopfree.  That  is,  no  node  is  down¬ 
stream  of  itself.  VThen  a  node  is  in  loopdanger  its  uphill 
neighbors  are  just  its  upstream  neighbors.  A  node  in  loop- 
danger  is  uphill  of  itself  only  if  it  is  upstream  of  itself. 

But  0  is  loopfree.  So,  no  node  in  loopdanger  is  uphill  of 
itself.  If  j  is  a  downstream  neighbor  of  a  node  i  that  is  not  in 
loopdanger  then  £  W^.  If  j  is  a  downhill  but  not  down¬ 
stream  neighbor  of  i  then  W ^  <  g^  +  <_  W^.  Therefore, 
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such  that 


(3.1.6) 


For  these  algorithms  is  equal  to  the  maximum  value  in  the 
interval 


[  min  w.., 
fc:*ik>0 


7  .  y .  . 

‘j  i) 


♦ij> 


and  t(j  equals  ^  +  $  .  Algorithm  I  would  minimize  IIB3a'  while 
algorithm  II  minimizes  IIB3a. 

For  this  W  (=  G)  and  \p ,  condition  IIB4a  is  automatically 
satisfied.  To  see  this  note  that  if  IIB3a  is  minimized  then 


Z 


j 


< 


I 


j 


+ 


1 

7 


z  ■  V-  .  d) .  .  T  . 
3  i]?i]  i 


<  0 


If  IIB3a'  is  minimized  then 


*  I  Vij’ij'VV  i 0 

In  either  case,  1  0*  So 

3  3  13  ^13  13 


<  Z  •  W  •  •  <j>  ■  ■ 
-  3  13V13 


=  W. 
1 


(3.1.7) 


The  watershed  distance  method  used  in  the  previous  chapter  just 
prevents  loops  from  developing,  which  in  turn  prevents  dead¬ 
locks  from  occurring  in  the  algorithm.  The  loopdanger  method 
of  algorithm  ’  is  another  way  of  doing  this.  We  will  show  this 
shortly.  The  loopdanger  method  is  slightly  harder  to  analyze, 
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if  j  is  a  downhill  neighbor  of  i  then  with  equality 

only  if  j  is  a  downstream  neighbor  of  i.  A  node  not  in 
loopdanger  is  downhill  of  itself  only  if  it  is  downstream  of 
itself.  But  (J)  is  loopfree.  So,  no  node  not  in  loopdanger  is 
downhill  of  itself.  Thus,  altogether,  Z  is  loopfree. 


3 . 2  Convergence 

In  a  long  series  of  remarks  we  will  show  that  algorithm 
.4  converges  to  the  optimal  flow.  Some  of  these  remarks  assume 


A  >  B  >  4  X 


(3.2.1) 


even  though  the  remarks  could  be  established  with  different 

numerical  constants  if  this  were  not  assumed.  From  IIB2  one 

9E 

sees  that  there  is  no  loss  in  generality  in  increasing  A  to  be 
greater  than  B  and  degreasing  X  to  be  less  than  B/4. 

The  first  four  remarks  develop  an  inequality  between  gf 
and  j f | .  This  inequality  will  also  show  that  the  directional 
derivative  gf  is  non-positive. 


Remark  3.2.1  gF  <_  iV'MT-T*’) 

Proof.  gF  -Z  .g. .F. ..  Using  IBl  in  this  expression  gives, 
iJ  1J  ^-J 


=  z.  .V.  . F. .  -  Z .W .T. 
13  13  13  3  3  3 


=  W F  -  WT 


(3.2.2) 


Using  «T  =  F  in  IIB5  gives 


SI 


F  =  t>  (T-T~ )  +  ii/T+  -  $T_  (3.2.3) 

Using  this  in  (3.2.2)  gives 

gF  *  ^(T-T-)  +  W\p T+  -  W<p T*  -  WT 
Using  T  =  T+  -  T~  and  then  rearranging  the  terms  gives 

gF  =  !/<{>  (T-T“)  +  (Vi|>-W)T+  +  )  T~  (3.2.4) 


From  IB2  we  have  W  <_  W<p.  From  IIB4a,  yip  £  W.  Using  both  of 
these  in  (3.2.4)  qives  the  remark. 

Remark  3.2.2.  ^(T-T*)  <  -  A  j  <|>  (T-T~ )  j  2 

Proof.  By  using  the  same  argument  that  led  to  (2.2.3),  if  node 
i  minimizes  IIB3a  then 


l  ,W-  -Ip.  .  +  l  -V-  .47  .T.  <  0 
Alternately,  if  node  i  minimizes  IIB3a’  then 


(3.2.5) 


Z -W. ,4. •  +  Z -V.  .47 . (T.+T. )  <  0 
j  13^13  j  lj  13  1  i  — 


(3.2.6) 


From  IIB2b,  1  j ^>i j  '  We  also  have  Ti  -  Ti'‘Ti 

and  -  t”.  So,  from  the  above  two  cases,  (3.2.5) 

and  (3.2.6),  we  see  that 


Z .W .  .4. .  +  \Z .47 . (T.-T. )  <  0 
3  1313  313  l  1  — 


(3.2.7) 


Multiplying  this  by  T^-T^  and  then  summing  over  i  gives  the 
remark. 


Remark  3.2.3  |F|2  <  |  NLj$(T~T") |2 
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E5ES! 


Proof.  Equation  (3.2.3)  is  the  same  as  equation  (2.3.6)  with 
p  in  place  of  <t> ' .  If  this  substitution  is  maintained  in  the 
equations  subsequent  to  (2.3.6)  we  get  the  remark  from  (2.3.24). 

Remark  3.2.4.  |f|^  £  -  gf 

Proof.  Combining  the  previous  three  remarks, 

lF|2-~ITgF  (3.2.8) 

Since  |  f  [ 2  <_  C£k|FK|2  and  EKgFK  =  gf,  the  remark  follows. 

Comment.  This  remark  implies  that  the  directional  derivative 
gf  is  non-positive. 

The  following  four  remarks  follow  a  similar  development 
for  ?. 

Remark  3.2.5.  gF  _<  wJ(T-T  ) 

Proof.  Using  <t>T  =  F  in  IIB9  gives 

F  =  J(T-  T  ”)  +  'i/T  +  -  $T  ”  (3.2.9) 

The  proof  of  the  remark  is  the  same  as  for  remark  3.2.1  except 
with  F  in  place  of  F  and  (3.2.9)  in  place  of  (3.2.3). 

Remark  3.2.6.  V$(T-T  ")  <  -  B  |$“(T-T~) |2 
Proof.  Let  the  following  be  defined. 

K .  :=  min  V .  .  (3.2.10) 

i  ,  i] 


From  IIB8, 


w 


[ 


1 

4 

i 


—  + 


_  + 


Z  .  W .  .  <t> .  .  =  K  .  I  .  $  .  . 
3  13*13  l  3  ^ 

We  have 
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(3.2.11) 


-  + 


■  VjHi  -  ^l^ij 


*  Ki‘,j*ij  " 


(3.2.12) 


If  F.  •  is  a  leak  then,  from  IIBi,  .  =  <fc.  ..  In  this  case, 

J  •  J  •  J 

$.~T.  =  <j.  .T.  =  F.  .  .  Then  with  IIB7a,  if  F.  .  is  a  leak 


-i 


♦ijTi  i  B  <*ij-Ki> 


(3.2.13) 


If  F^  is  not  a  leak  then,  from  IIB8,  =  0  and  (3.2.13) 

still  holds.  Using  (3.2.13)  in  (3.2.12), 


-  “  BIj(nj)2Ti 

<  -  BZj (?ij)2(Ti-Ti‘) 


(3.2.14) 


Multiplying  this  by  -  T^~  and  summing  over  i  gives  the 
remark. 


Remark  3.2.7  ) F  j  2  <_  2NL  |  <j>  "  (T-T  ~)  |2 

Proof.  Equation  (3.2.9)  is  the  same  as  equation  (2.3.6)  except 
with  bar  quantities  in  place  of  tilde  quantities  and  with  <i>  in 
place  of  -j 1  .  If  this  substitution  is  maintained  in  the  equa¬ 


tions  subsequent  to  (2.3.6)  we  get  the  following  from  (2.3.23). 
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|F|2  <  7  (Tm*Tm~>)2 

—  2  mn  mn  m  m 


=  §(2E  d»  ‘  (T  -T  ")  )  2 
2  mn  mn  m  m 


<  2NLI  ($  ~(T  -T _"))2 
—  mn  mn  m  m 


=  2NL 1  $  " (T-T  ")  |2 


This  is  the  remark. 

Remark  3.2.8.  M|f|2  <_  -  gf 

Proof.  Combining  the  previous  three  remarks, 

| F | 2  <  -  2NLB_1gF  (3.2.15) 

With  (3.14),  which  is  B  -  2MCLN,  we  get 

MC | F | 2  <  -gF  (3.2.16) 

Since  |f|2  £  CEjjFk|2  and  L^gF^  *  gf,  the  remark  follows. 

Comment.  This  remark  implies  that  the  directional  derivative 
gf  is  non-positive. 

The  next  two  remarks  develop  an  inequality  between  the 
cost  change  and  the  directional  derivatives. 

If  f  +  f  +y(f-f)eF  then  with  (3.1.2), 

J(f+F+y(f-F) )  -  J(f)  <g(F+y(f-F))  +  j  M|F+y  (f-F)  |  2 

(3.2.17) 

The  following  remark  sidesteps  the  F  condition. 

Remark  3.2.9.  For  any  I,yr£  that  makes  RHS(3.2.17)  non-posi¬ 
tive  the  inequality  (3.2.17)  holds. 
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Proof.  Let  f  =  f  +  y(f-?).  If  the  remark  holds  then 
J(f+f)  -  J(f)  <  0.  Suppose  to  the  contrary  that  J(f+f)  - 

A 

J(f)  >  0  and  that  f  makes  RHS(3.2.17)  non-positive,  i.e. 
gf  +  ^  M |  f  |  ^  <_  0.  Because  of  the  former  supposition,  f?<0 . 
Then  gf  <  -  j  M|f|  <  0.  Because  gf  <  0  and  J  is  continuous 
there  exists  an  a,  0  <  a  <  1,  such  that  J(f+af)  =  J(f).  Then 
f+afeF  and 

J  (f+af )  -  J(f)  <  agf  +  i  Met 2  |  f  !  2 

<  agf  +  a^-  M |  f  |  2 
=  a  (gf+  ~  M|  f  |  2) 

<  0 

That  is,  J(f+af)  -  J(f)  <  0.  This  contradicts  our  selection 
of  a.  Therefore,  the  assumption  must  be  false. 

Let  the  following  be  defined. 

AJ  :=  min  J ( f +l+y ( f-T) )  -  J(f)  (3.2.18) 

0<y<_1 


A 1 J  :»  min  RHS(3.2.17) 

1  °iYi1 

=  min  {gf+yg(f-f)  +  i  M|  f+Y (f-f )  |  2}  (3.2.19) 

0<y<1 

Algorithm  .4  generates  AJ.  The  above  remark  implies  that  if 
A  ^ J  <_  0  then  AJ  <_  A^J. 

Remark  3.2.10.  AJ  <_  A  B  ^(gf+gf) 


t 
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Proof.  Using  |a+b|2  <_  2  j  a  |  ^  +  2jb[2, 

|f+y(f-?) |2  =  | (1-Y)f+Yf I2 

<2  u-y)  2|?| 2  +  2Y  |  f  |  2 

Multiplying  this  by  M  and  then  using  remarks  3.2.4  and  3.2.8, 

M | F+Y ( f-f ) | 2  <  -  2 (1-Y) 2gf  -  MCNLA_1Y2gf 
Using  3=  2MCNL  (from  (3.1.4)), 

M|F+Y(f-F)  I2  I  -  2 (1-Y ) 2gf  -  B(2A)_1Y2gf  (3.2.20) 

From  (3.2.19) , 

AjJ  <  gF  +  Ygf  -  ygl  +  i  M) f+Y (f-f) j 2 
Using  (3.2.20) , 

A^J  <_  gF  +  Ygf  -  ygf  -  (l-Y)2gF  -  B(4A)  *Y2gf 
2 

Expanding  (1-Y)  and  simplifying, 

A^J  <_  Ygf  +  Ygf  -  Y2gF  -  B(4A)  H2gf 
From  (3.2.1),  B(4A)"1  >_  1.  Thus, 

AlJ  <  Ygf  +  Ygf  -  B(4\)*1Y2gf  -  B(4A)'1Y2gf 

=  (Y  -  Y2)  (gF+gf)  (3.2.21) 

The  RHS  above  is  minimized  over  Y  at  y  =  2A/B.  This  y  is 
positive  and  less  than  one.  Using  it  in  the  above  aives 
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AjJ  <_  XB_1(g?+gf)  (3.2.22) 

Since  the  directional  derivatives,  gf  and  gf,  are  non-positive, 
A^J  is  ncn-positive .  Thus,  AJ  <_  A^J  and  the  remark  follows. 

The  next  remark  says  that  the  flow  change  is  bounded  by 
the  cost  change. 

Remark  3,2.11.  M|F+y(f-f)|2  <  -  |(B/X)2AJ 

Proof.  Using  ( 1-y) 2  <  1  <  B(4A)_1  and  y2  <  1  in  (3.2.20), 

M|  f?  +  y(f-F)  |2  <  -  B  (2X)  -1  (gf+gf )  (3.2.23) 

Using  the  previous  remark  in  the  above  gives  the  current  remark. 

We  repeat  the  definition  of  Ki  (from  (3.2.10))  and  make 
two  more  definitions. 


K.  :=  min  W.  .  (3.2.24) 

U.  :=  max  {W  |  <d  +  g  .  >  0}  (3.2.25) 

-e“i 

U!  :=  max  {U.  .  |  <p.  .  +  d>  >  0}  (3.2.26) 

i  7  1 J  1 J 

3*"i 

At  the  end  of  the  iteration  we  have 


max  {V.  •  |  .  >0  } 

je3.  13 


r  uj 

if 

y*l 

o 

i-3 

■* 

II 

o 

i 

1 

i  ui 

if 

y<l 

and  TJ^O 

(3.2.27) 


Since  jk  ^  +  4^  >  0  implies  +  <P^j  >  0 ,  we  have  U|  <  U^. 
In  the  following  four  remarks  it  will  be  shown  that  iu-K  I  is 
bounded  by  the  cost  change. 
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Remark  3.2.12.  |u ' -K | 2  <  2A2 | t(T+T+) | 2 

Proof.  We  consider  first  the  case  of  node  i  minimizing  IIB3a 
which  is 


l.W.  .<}>.  , 

3  i]  1] 


1  "2 
+  i  Z  .  V.  .  <(>.  .T. 

■  I  3  1]  1]  1 


(3.2.28) 


As  in  (2.2.2),  any  change  from  the  optimal  4>  to,  say,  \p  is 
a  non-descent  change  in  (3.2.37).  Thus, 


L  .  [W .  .  +  V .  .  <J) .  .  T .  ]  (^  .  .  ~ 0  .  - )  >  0 

3  L  13  i]vi]  iJ  vvi3  *13'  - 


(3.2.P9) 


The  bracketed  quantity  is  the  gradient  of  (3.2.28).  Let  k 
be  the  arg-min  of  (3.2.24)  and  u  be  the  arg-max  of  (3.2.26). 
Let  e  =  By  the  definition  of  u,  e  is  positive. 

Let  be  the  following 


$iu  -  e  if 

$i:j  if  u^j^k 

4>ik  +e  if  j=k 


(3.2.29)  becomes 


lui  *  ‘'iu*iaTi)|-e)  +  IV'ik*ikTlI«  -  0 


(3.2.30) 


Dividing  by  e  and  using  IIB2a, 

-  «i  +  Ki  +  Ai*ikiTi  i  0 

Rearranging  and  squaring, 


i  A2diiu!  ♦  r»lki>  v 


1  2A2^iu+iik)Ti 
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<  2A2  |  (pi  |  2T?  (3.2.31) 

For  the  case  of  node  i  minimizing  IIB3a'  the  corresponding 
argument  yields 

(U[-Ki)2  <  2A2Ui|2(Ti+Ti)2  (3.2.32) 

In  either  case, 

(Ul"Ki)2  ±  2A2 I I  2  max{T2  ,(Ti+Ti) 2} 

-  2A2|(J>i|2(Ti+T^)2 

=  2A2Ui(Ti+T^)  |  2  (3.2.33) 


(3.2.35) 
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2  2  2 
Since  8  <  A  <  2A  . 

2A2Ij02jT2  >  (Ui-Ki)2  (3.2.36) 

Case  2,  U|  *  U^.  Prom  (3.2.33),  the  above  inequality 
is  again  satisfied. 

Summing  (3.2.36)  over  i  then  gives  the  remark. 

Remark  3.2.14.  |  *  (T+T+)  |  2  <  Nl|  <J>  (T-t")  |  2 

Proof.  We  first  derive  an  expression  for  T  .  We  have- 

T~  <  e"f".  With  (2.3.7)  , 

T"  <  e"4)"(T-t")  +  E~<)>T_  (3.2.37) 


Substracting  the  last  term  from  both  sides  and  then  multiplying 
by  aN  (defined  in  (2.2.8)), 

T~  <  0NET(T-T")  (3.2.38) 

Elementwise,  this  is 


T.  <  l  9-  d>  (T  -T  ) 
l  -  mn  i,n¥mn'  m  m' 


c  £  A  (T  -T  ) 
—  mn^mn  m  m 


(3.2.39) 


We-  next  derive  a  similar  expression  for  T+.  Here  we  start 
with  T+  <_  E  F+  and  (2.3.17)  with  $  in  place  of  a'. 

T+  <  E*:>+(T-T~)  +  E"*T+  (3.2.40) 


We  define  the  following, 
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0^  :=  (I-E-^)'1  (3.2.41) 

This  has  properties  similar  to  0N  including  0  < 9^  n  <  1. 
Subtracting  the  last  term  of  (3.2.40)  from  both  sides  and 
multiplying  9^  gives 

T+  <  9"E_0+(T-T")  (3.2.42) 


Elementwise 

T*  <_  E 
<  E 


,  this  is 

9?  ,i)+  (T  -T~) 

mn  i,nrmn  m  m 

t>+  (T  -T~) 
mn  *mn  m  m 


=  y  E  |  <J>  |  (T  -T  j 

2.  mn  mn '  m  m 


We  now  prove  the  remark.  We  have  the  identity. 


(3.2.43) 


Both  and  T*  cannot  be  positive.  Using  this  fact  and  (3.2.39) 
and  (3.2.43)  in  the  last  term. 


Since  d>rnil=»0  when  m=dest,  there  are  no  more  than  L  terms  in  the 
RHS.  Squaring  both  sides  and  using  Cauchy's  inequality  gives 

Ui;j  (T.+T+)  )2  <  L(0ij(Ti-T‘))2  +  |i(T-T")  |2 

Summing  this  over  (i,j), 

!  D  (T+T+)  :2  <  L  |  (T-T~ )  j2  +  \  LE  .  .  $2  j  q>  (T-T- )  |2  (3.2.44) 
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subsequence  of  (f(m)}  that  converges  to  fc.  Since  remarks 
3.2.16  and  3.2.11  say  that  jf(m+l)  -  f(m)|  goes  to  zero,  the 
sequence  (f(m^+l)}  also  converges  to  fc.  Likewise  for 
{f(mi+2)}<  .  ..,  {f  (n^+N-1)  }. 

Let  gC  be  the  gradient  of  J(fc).  Let  Dc  be  the  shortest 

distance  wrt  gc  to  a  particular  destination.  In  the  next  five 

remarks  we  will  show  that  there  exists  an  m'  such  that  rru  >  m’ 

and  D?  >  g?.  +  D?  imply  F.,  (m.+N-l)  =  0.  Let  Sc  be  the  node 
3  3  ^  3  i 

neighborhood  of  Dc ,  i.e. 

(i,j)e  SC  if  and  only  if  =  g^j  +  D ^  (3.2.48) 

Let  s  be  the  smallest  miss  from  the  shortest  distance,  i.e. 

s  -  min  {g^  +  D?  -  D?  |(i,j)*SC}  (3.2.49) 


Let  g  .  „  be  defined  by 
min 


g  .  =  nan  min 

m  f e?  (i , j ) e; 


(3.2.50) 


Because  of  (3.1.1)  and  the  fact  that  F  is  closed  and  bounded, 

g  .  >0.  Let  e  be  any  number  satisfying 

Jmin 

mints, g  .  }  >  e  >  0  (3.2.51) 

min 

Let  m1  be  such  that  for  m^  >  m’  and  for  n  =  m^ ,m^+l , . . . ,m^+N-l 
the  following  holds. 


I  g  (n) 


(3.2.52a) 


U(n)  -  K (n) |  < 


6_ 

4 


(3.2.52b) 
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<  g .  .  - 

-  yi3 

gij 

+g  . 

y  13 

+  g  .  -  g  .  +  g  . 

^jm  ^jm  ^-jm 

-  y  13 

+  gc  ■ 

y  13 

+  i g .  — g?  1  +  g? 
iy3m  y3m'  yjm 

_  1  c 

1 

c 

1  Ci,  c 

<  |  g-g 

1  + 

g .  .  + 
y  13 

lg_g  1  gjm  ••• 

<  (N-l) | g-gc|  +  D? 


The  last  inequality  used  (3.2.52a). 


Remark  3.2.18.  If  the  present  iteration  satisfies  (3 


and  if  (i,j )&S  then 


W .  .  >  D?  +  •— 
13-1  4 


Proof. 


W.  .  -  q.  .  +  W. 
13  yi3  3 


-  gi3  +  K3 


Let  W.  =  K  .  Then 
3m  m 


W .  .  >  g.  .  +W . 
13  ~  ^13  3"» 


>  g .  .  +  g  .  +  K 

—  ^13  ’3m  m 


>  g .  .  +  g .  +  g  + 

-  ^13  ’3m  ymn 


'  gij 
rij'gu 

.__C|  x 


c  c 

g  .  .  +  g  .  -  g  .  + 

13  3m  ^3tn 

c  1 

C  1 

,  C 

y  13  1 

g  .  -g . 
y3m  ’3m1 

+  g  • 
y  jm 

c  1 

C  |  .  c 

i 3  ‘  |g'g 

1  y3m 

(3.2.56) 


.2.52a) 


(3.2.57) 


(3.2.58) 
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vij  1  - (N-l) |g-gCl  +  D?  +  s  (3.2.59) 

D?+s  appears  here  because  (i,j)^Jc  so  the  expression 
g^j+gjm+  ...  is  not  along  a  shortest  path  in  D?.  Using 
(3.2.51)  and  (3.2.52a)  leads  to  the  remark. 


Remark  3.2.19.  If  the  present  iteration  satisfies  (3.2.52a) 
then  the  tight  nodes  are  not  in  loopdanger. 

Proof.  We  proceed  by  induction.  The  destination  node  is  tri¬ 
vially  tight  and  not  in  loopdanger.  Suppose  that  i  is  tight  and  that 
Dj<  implies  j  is  not  in  loopdanger.  with  an  expansion  of 

similar  to  (3.2.58)  except  with  (i , j ) , ( j ,m) , . . .eSc  we  have 


(3.2.60) 


(3.2.62) 


* 
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(3.2.62)  used  (3.2.51).  IB3  then  says  that  node  i  is  not  in 
loopdanger . 

Remark  3.2.20.  If  the  present  iteration  satisfies  (3.2.52a) 
then  the  tight  nodes  remain  tight  in  the  next  iteration. 

Proof.  We  proceed  by  induction.  The  destination  node 
trivially  remains  tight  in  the  next  iteration.  Suppose  that 
node  i  is  tight  and  that  each  node  j  with  D?  <  D?  remains  tight 
in  the  next  iteration.  With  remark  3.2.17  we  have 


W .  <  Z  W .  0. 
i  —  m  inrim 


< 


(3.2.63) 


If  a  node  j  with  *  0  enters  Z i  then  from  IIAb, 

Vij  £  wi*  Using  (3.2.63),  <  D?  +  e/4.  Remark  3.2.18  then 

implies  (i,j)eSc.  Therefore,  jeZ^^  implies  (i,j)eS?.  If  <{>*^>0 
then  jeZ^  and  (i,j)eJ?  and  D?  <  D?.  From  our  supposition  j 
remains  tight  in  the  next  iteration.  Therefore,  node  i 
remains  tight. 


Remark  3.2.21.  If  the  present  iteration  satisfies  (3.2.52a&b) 
and  not  all  nodes  are  tight  then  a  node  will  become  tight  in 
the  next  iteration . 

Proof.  Let  i  =  arg-min  { D?  |j  is  not  tight).  Note  that 

with  this  selection  if  (i,m)e5c  then  m  is  tight.  We  first 

show  that  Z.  contains  some  m  for  which  (i,m)eZ'c.  If  q>.  >  0 
i  un 

Q 

for  such  an  m  then  meZ^.  Thus,  suppose  that  (i,m)^S  for 
5.  >0.  Then  remark  3.2.18  and  step  IB2  say 


al]  m  such,  that 
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Wi  -  Di  +  X  (3.2.64) 

If  m  is  such  that  (i,m)eSc  then  m  is  tight  and  remark  3.2.17 
says 

W.  <  D?  +  |  (3.2.65) 

im  —  i  4 

Thus,  <_  W^.  Since  remark  3.2.19  says  that  node  m  is  not 

in  loopdanger,  m  enters  2 

The  first  part  of  this  proof  has  shown  that  has  a  node 
m  such  that  (i,m)e5c.  Thus,  remark  3.2.17  says 

K.  <  Dc  +  %  (3.2.66) 

1  —  1  4 

With  (3.2.52b), 


<  | U-K |  +  D?  +  | 

<  D?  +  %■  (3.2.67) 

—  1  i. 

A  review  of  IIIC  and  (3.2.25)  shows  that 
max  1 <J>*  j  >  0} 

j 

<_  dJ  +  |  (3.2.68) 

Remark  3.2.18  then  implies  that  if  >  0  then  (i,j)eJc.  The 
previous  remark  says  that  tight  nodes  remain  tight  in  the  next 
iteration.  Thus,  node  i  will  be  tight  in  the  next  iteration. 


Remark  3.2.21.  In  each  flow  in  (f(m^+N-l)},  m^>  m 1 ,  all  nodes 
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are  tight. 

Proof.  With  the  previous  remark  at  least  one  node  (the  destin¬ 
ation)  is  tight  in  f(m^),  two  nodes  in  f(m^+l),...,  all  nodes 
in  f(m^-N-l). 

The  next  two  remarks  prove  the  convergence  of  { J (m) }  to 

Jmin. 

Remark  3.2.22.  For  any  cluster  point  fc  of  {f(m)}  and  for  any 
e  satisfying  (3.2.51)  there  is  a  subsequence  {f(mj)}  such  that 
every  commodity  in  each  f (m^ )  satisfies 

gF  -  DR  <_  Nre 

where  r  =  Z^Z±j(k H*. 

Proof.  We  use  {m^  }  *  {mi+Njmi  >  m'}.  has  an  expansion  in 

°i  =  «ij  +  *  ••• 

*  »ij  '  3ij  +  +  -  *jk  +  5jk  +  ••• 

-  +  9°j  '  |9'9C|  -  g°k  -  ... 

>  D?  -  (N-l) |g-gc | 

>  D?  -  t  (3.2.69) 

Since  i  is  tight  by  the  previous  remark,  we  have 


Z  ■  w .  .  •> .  .  <  D  . 

3  13  13  -  i 


e 

4 


i  Di 


+ 


€ 

7 


(3.2.70) 


The  last  inequality  used  (3.2.69). 


We  rewrite  the  above  as 


Multiplying  this  by  , 

V<j»iTi  -  DiTi  <  eT±  £  er 
Summing  over  i, 

V«j)T  -  DT  <_  Nre 
We  have 


(3.2.71) 


V<t)T-  DT  =  P/F  -  DT 


*  Eij(*ij*VFij  - 

*  gF  +ZjWj *Tj“Rj>  -  DT 

>  gF  +  D(T-R)  -  DT 


=  gF  “  DR  (3.2.72) 
This  with  (3.2.71)  gives  the  remark. 

Remark  3.2.23.  The  cluster  points  of  {f(m)>  are  optimal  flows. 
Proof.  Since  e  is  arbitrarily  small  in  the  previous  remark,  we 
have  for  each  cluster  point  fc  and  every  commodity 


Remark  A. 1.5  in  appendix  A  then  says  that  fc  is  optimal. 
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Note  3.N.I.  This  note  shows  that  if  X  >  -rB  3  4  MCLN  then 
the  algorithm  can1  safely  use  F+F  for  the  new  flow  F*  bypassing 
F  and  the  calculation  of  Y.  Let  this  simplified  algorithm  be 
j4  ’  ( f )  and  let  {f(m)}1  be  the  sequence  of  flows  generated  by 
A • ,  i.e.  f (m -l)e4' (f (m) ) . 


Remark  3.N1.1.  If  \  ^  MCLN  then  the  cluster  points  of 

{ f (m) } *  are  optimal  flows. 

Proof.  Remarks  3. 2. 1-4  still  hold.  Remark  3.2.4  simplifies 
into 


M  |  f  |  <  -  gf 


Remark  3.2.9  still  holds.  Let 
A’J  *  J(f+f)  -  J(f) 


AjJ  *  gf  +  j  M|f | 


(3.N1.1) 


(3.N1.2) 

(3.N1.3) 


Algorithm  A'  generates  A'J.  Using  y=l ,  remark  3.2.9  says 
that  if  A^J  is  non-positive  then  A'J  <_  A^J.  In  place  of 
remark  3.2.10  we  will  show  that  A'J  j  gf.  (3.N1.1)  says  that 
gf  is  non-positive.  Using  (3.N1.1)  in  (3.N1.3)  gives 
A^J  *  j  gf.  Thus,  A^J  is  non-positive.  Thus, 


A'J  <  ~  gf 


(3.N1.4) 


In  place  of  remark  3.2.11  we  combine  (3.N1.1)  and  (3.N1.4)  to 


Mil?!2  <  -2A  '  J 


(3.N1.5) 


At  the  end  of  the  iteration  we  will  have 


72 


max{V  .  |  <J>*  .  >  0}  =  U!  (3.N1.6) 

j  3  3 

So  U'  instead  of  U  is  the  quantity  of  interest  in  the  middle 
part  of  the  proof.  Remarks  3.2.12  and  3.2.14  still  hold  and 
they  combine  with  remarks  3.2.2  and  3.2.1  and  (3.N1.4)  to  get 

f  U ' -K ( 2  <  -4NLA2A“1A'J  (3.N1.7) 

in  place  of  remark  3.2.15.  Remarks  3.2.16-23  still  hold  with 
A'J  in  place  of  AJ,  {f(m)}'  in  place  of  (f(m)}  and  U'  in 
place  of  U.  This  proves  the  remark. 


Note  3.N.2.  This  note  describes  Gallager's  [77]  algorithm. 

It  can  be  used  to  generate  <p  in  the  routing  outline  of  this 
chapter.  In  this  algorithm  the  node  distance  is  W ^  ^  j  j 

Let  6  be  a  positive  scalar.  For  each  node  i  the  following 
steps  are  done. 


n  =  arg-mintV^j | je 


(3.N2.1) 


-  min  s  j 


t  #  —  ft  .  > 

/  if  j^n,jezi 


13 


-I 


k^n^ik 


if  j=n 


F.  •  =*  (4.  .+*..)  (T.+T.  )  -  F.  • 
13  ¥13  yi3  l  l'  13 


(3.N2.2) 

je  i  (3.N2.3) 


To  match  the  routing  outline  to  this  algorithm  we  would  have 
> p  -  i>+ii  and 


fo  if 

j=n 

j 

1.3  if 

(3.N2.4) 
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We  require  A  £  0.  With  the  following  we  will  show  that 
>.  <  3/N  satisfies  IIB2b. 


~2  2  ~+  2 
3  13  3  913  3  *3 

<  Z  .  ( d>7  -  )  2  +  (Z  .(bt.  )  2 
-  3  13  3  13 

=  Z  .  (d>“  ■ )  2  +  (Z  .  d>7  . )  2 
3  ^13'  3  13 

£  Z.  (i“  j  )  2  +  (N-l)  Z  j  (^j)2 


*  S£3  if  Vij  i  uj«ij 


(3.N2.5) 


(3.N2.6) 


Appendix  A 

Dual  of  the  Routing  Problem 


The  first  section  of  this  appendix  derives  several  condi¬ 
tions  under  which  the  flow  is  optimal.  The  second  section 

gives  a  bound  on  the  error  J(f)  -  J  .  .  All  of  the  results 

min 

given  here  can  be  found  in  the  literature.  They  are  included 
here  for  the  sake  of  completeness. 


A. 1.  Linear  Programming  Application 

In  section  3.1  the  cost  function  is  defined  on  the  set 

F  =  { f | J (f )  <  J(f°),  f  =  IkFk,  EFk  =  Rk,  Fk  >  0,  ke?}.  By 

(3.1.1)  J  is  convex  and  has  positive  partial  derivatives  on 

F.  In  this  section  it  will  be  convenient  to  assume  that 

J  .  <  j(f°).  (If  this  is  not  so  then  every  flow  in  F  is 
mm 

automatically  optimal.)  We  recast  the  routing  problem  as 
minimize  J(f') 

Such  that  J(f')  <  J(f°)  (A. 1.1) 

f'  =  lkF'k 
EF'k  =  Rk 
F'k  >  0 
k  eC 


Let  g  be  the  gradient  at  f.  Its  elements  are  positive. 

Remark  A . 1 . 2 .  If  f  is  optimal  in  (A. 1.1)  then  each  commodity 
F  is  optimal  in  the  following  problem.  If  every  commodity  F 
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is  optimal  in  the  following  problem  then  f  is  optimal  in 
(A. 1.1) . 

minimize  gF* 

such  that  EF'  =  R  .(A.  1.3) 

F*  £  0 

Proof.  Suppose  that  f  is  optimal  in  (A. 1.1).  For  any  F' 
satisfying  the  constraints  of  (A. 1.3)  there  exists  an  e  '  >0 
such  that  the  flow  f  +  e(F'-F)  satisfies  the  constraints  of 
(A. 1.1)  for  all  e,  0<e<e' .  Because  f  is  optimal  in  (A. 1.1), 


That  is,  g(F'-F)  >  0.  This  says  that  for  any  F’,  gF  <  gF'. 
Thus,  F  is  optimal  in  (A.1.3). 

Now  suppose  that  every  commodity  F  is  optimal  in  (A.1.3). 
Let  f'  be  any  object  that  satisfies  the  constraints  of 
(A. 1.1).  F'  satisfies  the  constraints  of  (A.1.3)  and  since 
F  is  optimal  there,  gF  <  gF'.  Since  J  is  convex, 

J(F’)  i  J(f)  +  g(f’-f) 

-  J(f)  +  Ej^CF^-F*) 

£  J(f)  (A. 1.5) 

Therefore,  f  is  optimal  in  (A. 1.1). 

The  dual  problem  of  (A.1.3)  is 


76 


maximize  H’R 

such  that  II'E  <  g  (A.  1.6) 

Since  Rdest  "  -£i^destRi  we  have  n'R  =  EiniRi  =  2  iciest 
(I^i^^dest)  Ri*  witil  the  translation  11^  =  n£  —  n^est  the  dual 
problem  becomes 

maximize  HR 

such  that  "  nj  -  (i,j)e£  (A. 1.7) 

^dest  *  ® 

Let  D  be  the  shortest  distance  wrt  g,  i.e. 

Ddest  =  °»  Di  =  ^ij  +  Dj>  (A.  1.8) 

Remark  A. 1 , 2 .  For  any  R,  the  shortest  distance  D  is  optimal 
in  the  dual  problem. 

Proof.  By  an  induction,  we  will  show  that  D  >  H.  Since  R^ 
is  non-negative  for  i  ?  dest  this  will  give  DR  >  DII  and  prove 
the  remark.  We  have  D^est  *  ^dest'  Now  suPPose  that 
Dj  >  Hj  for  all  j  such  that  D ^  .  Let  m  =  arg-min  {g^^j+D^}. 

Then 


Di  =  *ij  +  Dm 
2  *ij  +  nm 

i  ni  (a. 1.9) 

This  completes  the  induction. 
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“'IP'  . 
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Linear  programming  [e.g. ,  Luenberger  73]  says:  (I)  If  one  of 
the  problems,  (A. 1.3)  or  (A. 1.6),  has  an  optimal , solution 
then  both  problems  have  an  optimal  solution  with  the  same 
optimal  cost.  (II)  If  problems  (A. 1.3)  and  (A. 1.6)  have  a 
solution  with  the  same  optimal  cost  than  that  solution  is 
optimal.  This  and  the  preceding  remark  gives 

Remark  A. 1 . 3 .  F  is  optimal  in  (A. 1.3)  if  and  only  if  gF  =  DR. 

From  R  =  EF  comes  gF  =  DR  =  DEF  or  (g-DE)F  =  0.  Since 
g  -  DE  >  0,this  gives 

Remark  A. 1 . 4 .  F  is  optimal  in  (A. 1.3)  if  and  only  if  j 

Di  <  gAj  +  Dj  implies  F±;j  =  0  j 

Using  the  above  two  remarks  with  remark  A. 1.1  gives  j 

Remark  A. 1.5.  The  following  are  equivalent 

(i)  f  is  optimal  in  (A. 1.1) 

, 

(ii)  For  every  flow  f'  and  every  commodity,  gF  <  gF' 

(iii)  For  every  commodity,  gF  =  DR  j 

(iv)  For  every  commodity,  <  g—  +  Dj  implies  F^ ^  =  0. 

The  following  remark  gives  a  property  of  D. 

Remark  A. 1.6.  For  any  F'  satisfying  the  constraints  of  (A. 1.3), 

gF'  >  DR.  | 

Proof.  From  the  constraints  of  (A. 1.6)  we  have  DE  £  g.  Since 

F'  is  non-negative  DEF'  <  gF' .  Since  EF'  =  R,  the  remark 

follows.  J 


A. 2  Error  Bound 

Remark  A.  2.1.  J(f)  -  <  gf  -  DkRk 

-  mm  jc 

Proof.  Let  f*  be  the  optimal  flow.  Since  J  is  convex, 
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Jmin  =  J(f*)  *  J(f)  + 

«  J(f)  +  EkgF*k  -  gf 

=  J(£)  +  £kDkRk  -  gf  (A. 2.1) 


The  last  inequality  used  remark  A. 1.6.  In  practice  this 
bound  is  loose.  We  might  have  a  flow  whose  cost  agrees  with 
the  optimal  cost  to  four  significant  digits  but  the  error 
bound  will  confirm  only  the  first  two  digits. 


Appendix  B 
Routing  Samples 
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In  this  appendix,  four  algorithms  are  tried  on  three  dif¬ 
ferent  networks.  In  all  of  the  algorithms  the  proposed  flow 
change  was  scaled  down  to  minimize  the  network  cost  as  in 
step  III  of  the  algorithm  given  in  section  3.1.  The  four 
algorithms  were  numbered  in  the  order  of  their  computational 
complexity.  Algorithm  1  is  the  algorithm  given  in  section 
1.2.  Algorithm  2  is  Gallager's  algorithm  given  in  note  3.N.2 
We  used  3=1.  Algorithm  3  is  the  algorithm  of  section  2.3 
with  the  coefficient  of  the  quadratic  term  in  (2.3.1)  reduced 
to  1/2.  Algorithm  4  is  the  algorithm  of  section  2.2  with 
the  coefficient  of  the  quadratic  term  in  (2.2.1)  also  reduced 
to  1/2. 

The  following  is  the  link  cost  function  that  was  used. 

It  is  a  standard  mean  delay  formula  which  was  redefined  foi 
f i j  >  .999c^j  so  as  to  facilitate  the  loading  of  congested 
networks.  It  is  twice  continuously  differentiable. 


if  f.j  >  .999Cij 


(B.l) 


8C 


The  first  network  is  given  in  Figure  B.l.  Uniform  link 

]r 

capacities  were  used,  *  5.  The  input  rate  were  gen¬ 

erated  by  uniform  random  numbers  in  the  interval  [0,1]  and 
then  held  constant.  The  network  was  loaded  by  putting  the 
destination  into  am  empty  queue.  The  front  end  of  the  queue 
was  then  continually  serviced  by  checking  whether  all  of  its 
neighbors  had  been  enqueued  yet.  If  neighbor  m  had  not  been 

enqueued  then  it  was  enqueued  and  $  .  set  to  one. 

ini 

In  this  network  the  flow  branches  into  at  most  two  parts. 
Consequently,  algorithms  2  and  3  are  basically  the  same.  At 
3=2  they  would  have  performed  exactly  the  same.  Table  B.l 
showed  what  happened. 


Figure  B.l. 

Four  nodes. 

Eight  links. 

Algorithm 

Iteration 

1 

2 

3 

4 

1 

.3709 

.  3691 

.3757 

.3686 

2 

.3697 

.  3674 

.3702 

.3674 

3 

.  3681 

.3673 

.3683 

.3673 

4 

.3677 

.3676 

5 

.  3675 

.3674 

6 

.  3675 

.3673 

7 

.  3674 

8 

.  3673 

Table  B.l  Mean  Delay.  N  =  4,  L  =  8. 
Initial  mean  delay  =  . 3969 


To  see  what  would  happen  at  higher  loads  the  input  rate  was 
multiplied  by  2.5.  Table  B.2  gives  the  result.  We  stopped 
algorithm  1  in  the  tenth  iteration.  In  network  routing  it 
is  the  first  few  iterations  of  an  algorithm  that  most  inter 
ests  us. 


1 

2 

3 

4 

1 

.  8829 

.8829 

.8829 

1.3556 

2 

.  8629 

.8523 

.8341 

.9393 

3 

.8300 

.8209 

.8091 

.8123 

4 

.8174 

.8124 

.8032 

.8001 

5 

.8120 

.8047 

.8006 

.7999 

6 

.8089 

.8018 

.8001 

7 

.8070 

.8002 

.8000 

3 

.8021 

.8000 

.7999 

9 

.8012 

.7999 

10 

.8009 

Table 

B.2.  Mean  Delay. 

High 

Loading.  N  =  4 

,  L  =  8. 

Initial  mean 

delay 

=  3.8  x  105 

:  next 

network  that  was 

tried  . 

is  given  in  the 

following 

figure.  The  link  capacities  were  10.  The  input  rate  was 
determined  in  the  same  way  as  for  the  first  network. 


Figure  B.2.  Eight  nodes.  24  links. 
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With  this  network  we  examined  another  variable.  From  equation 
(3.2.4)  it  is  seen  that  sending  the  positive  node  flow 
changes  down  the  shortest  path  steepens  the  descent  direction. 

It  also  tends  to  concentrate  the  positive  flow  changes  onto 
a  few  paths ,  thus ,  increasing  the  directional  second  derivative . 
We  call  this  diversion  of  the  positive  node  flow  changes  down 
the  shortest  path  splitting.  Let  y  represent  the  routing  down 
the  shortest  path  and  let  $  be  the  routing  change  generated 
by  the  algorithm.  Without  splitting  we  have  F  =  ($+$)  (T+T)  -  F. 
With  splitting,  F  =  (<J»+$)  (T-T-)  +  i|»T+  -  F. 

This  node  flow  splitting  has  no  effect  on  algorithm  1. 

If  the  other  algorithms  used  the  splitting  then  we  called 
them  2S  or  3s  or  4S.  We  did  not  examine  this  variable  in  the 
first  network  as  those  nodes  two  hops  away  from  destination  had 
no  change  in  their  node  flow. 


Tables  B.3  and  B.4  show  the 
input  rates  were  multiplied  by  2. 


Algorithm 

Iteration 

1 

2 

2S 

1 

.2316 

.2375 

.2353 

2 

.2301 

.2323 

.2312 

3 

.2296 

.2305 

.2301 

4 

.2294 

.2298 

.2297 

5 

.2296 

.2295 

6 

.2295 

.2294 

7 

.2294 

8 

9 

10 


results 

.  In 

Table  B. 

4  the 

5. 

3 

3S 

4 

4S 

.2441 

.2423 

.2316 

.2306 

.2382 

.2365 

.2296 

.2295 

.2350 

.2336 

.2294 

.2294 

.2330 

.2320 

.2318 

.2310 

.2310 

.2305 

.2305 

.2301 

.2301 

•  22C9 

.2299 

.2297 

.2297 

.2296 

Table  B.3.  Mean  Delay,  n  *  8,  L  =  24. 
Initial  mean  delay  =  .2663 


Algorithm 

Iteration 

1 

2 

2S 

3 

3S 

4 

4S 

1 

.4760 

.  4591 

.4568 

.5526 

.  4883 

.6180 

.  5374 

2 

.  4389 

.4301 

.4309 

.4588 

.4431 

.4566 

.4444 

3 

.4359 

.4268 

.4271 

.4367 

.4299 

.4306 

.4292 

4 

.4346 

.4263 

.4263 

.4297 

.4271 

.4270 

.4260 

5 

.4335 

.4260 

.4260 

.4274 

.4263 

.4262 

.4258 

6 

.4331 

.4259 

.4259 

.4265 

.4260 

.4260 

.4258 

7 

.4326 

.4258 

.4258 

.4261 

.4259 

.4258 

.4257 

8 

.4323 

.4258 

.4258 

.4259 

.4258 

.4258 

9 

.4  321 

.4257 

.4259 

.4258 

.4257 

.4257 

10 

.4319 

.4258 

Table  B. 4 

.  Mean 

Delay. 

N  =  8 

,  L  =  24 

Initial  mean 

delay  - 

2.5  x 

106 

For 

the  final 

network 

we  took 

.  the  following 

abstraction 

of  an  ARPANET  topology  [Kleinrock 

76,  p. 

308]  . 

We  set 

the 

link  capacities  to  10  and  since  the  topology  was  non-symmetric 
we  let  the  input  rate  R*  =  .2.  Table  B.5  gives  the  result. 


1 

2 

2S 

3 

3S 

4 

4S 

1 

.9546 

.9524 

.9526 

.  9508 

.9515 

.9500 

.9509 

2  - 

.9417 

.  9332 

.9384 

.9375 

.9378 

.  9362 

.9364 

3 

.  9400 

.9125 

.9159 

.9161 

.9156 

.9270 

.9248 

4 

.9345 

.9108 

.9125 

.  9136 

.9130 

.9224 

.9197 

5 

.9312 

.9104 

.9113 

.9127 

.9116 

.9135 

.9119 

6 

.9261 

.9102 

.9108 

.9120 

.9110 

.9122 

.9112 

7 

.9233 

.9101 

.9106 

.9116 

.9105 

.9105 

.9107 

8 

.9218 

.9100 

.9104 

.9112 

.9102 

.9103 

.9105 

9 

.9206 

.9099 

.9102 

.9109 

.9099 

.9102 

.9103 

10 

.9191 

.9099 

.9101 

.9107 

.  9098 

.9101 

.9102 

Table  B.5.  Mean  Delay.  N  =  14,  L  *  42. 
Initial  mean  delay  =  1.2045 


In  the  third  network  there  were  no  3-way  flow  branching  and 
about  17  2-way  flow  branchings. 

The  strongest  suggestion  that  these  tables  make  is  that 
algorithm  1  should  not  be  used.  In  tables  B.4  and  B.5  what 
algorithm  1  reached  in  ten  iterations  most  of  the  other  algor¬ 
ithms  reached  in  three  iterations.  Splitting  the  node  flow  is 
of  less  significance  than  the  difference  between  algorithms  2 
and  3  which  itself  is  not  big. 
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